





Fundamentals of — 


T 
ommunication © 


t N 
i) 
1 N 

ji ® 
| 

i! | s 

i | 






























































ISBN-13: 978-0-13-335485-0 
ISBN-10: 0-13-335485-7 











90000 > 
> 
=m 
9 17801331354850 





www.pearsonhighered.com 





FUNDAMENTALS 
OF COMMUNICATION SYSTEMS 


Second Edition 


i 
John G. Proakis | 
Masoud Salehi | | 


PEARSON 


Boston Columbus Indianapolis New York San Francisco Upper Saddle River 
Amsterdam Cape Town Dubai London Madrid Milan Munich Paris Montréal Toronto 
Delhi Mexico City Sao Paulo Sydney HongKong Seoul Singapore Taipei Tokyo 


Vice President and Editorial Director, ECS: Marcia J. Horton 
Executive Editor: Andrew Gilfillan 

Editorial Assistant: William Opaluch 

Marketing Manager: Tim Galtigan 

Production Manager: Tom Benfatti 

Art Director: Jayne Conte 

Cover Designer: Bruce Kensedaar 

Full-Service Project Management/Composition: Integra 
Printer/Binder: Courier Westford 

Cover Printer: Courier Westford 


Credits and acknowledgments borrowed from other sources and reproduced, with permission, in this textbook 
appear on appropriate page within text. 


Copyright © 2014, 2005 by Pearson Education, Inc., publishing as Prentice Hall, 1 Lake Street, Upper Saddle 
River, NJ 07458. 


All rights reserved. Manufactured in the United States of America. This publication is protected by Copyright, 
and permission should be obtained from the publisher prior to any prohibited reproduction, storage in a retrieval 
system, or transmission in any form or by any means, electronic, mechanical, photocopying, recording, or 
likewise, To obtain permission(s) to use material from this work, please submit a written request to Pearson 
Education, Inc., Permissions Department, imprint permissions address. 


Many of the designations by manufacturers and seller to distinguish their products are claimed as trademarks. 
Where those designations appear in this book, and the publisher was aware of a trademark claim, the 
designations have been printed in initial caps or all caps. 


MATLAB is a registered trademark of The Math Works, Inc., 3 Apple Hill Drive, Natick, MA 01760-2098 


The author and publisher of this book have used their best efforts in preparing this book. These efforts include 
the development, research, and testing of the theories and programs to determine their effectiveness. The author 
and publisher make no warranty of any kind, expressed or implied, with regard to these programs or the 
documentation contained in this book. The author and publisher shall not be liable in any event for incidental or 
consequential damages in connection with, or arising out of, the furnishing, performance, or use of these 
programs. 


Library of Congress Cataloging-in-Publication Data 
Proakis, John G. 
Fundamentals of communication systems / John G. Proakis, Masoud Salehi. — Second edition. 
pages cm 
Includes bibliographical references. 
ISBN-13: 978-0-13-335485-0 
ISBN-1I0: 0-13-335485-7 
. Telecommunication systems. 2. Telecommunication. I. Salehi, Masoud. I1. Title. 
TKS101.P755 2014 
621.382-—de23 


— 


2013003293 


10 98765432 1 


PEARSON ISBN-10:  0-13-3354857 
www.pearsonhighered.com ISBN-13: 978-0-13-3354850 


To Felia, George, and Elena. 
—John G. Proakis 

To Fariba, Omid, Sina, and my parents. 
-Masoud Salehi 


Contents 


1 


PREFACE 
INTRODUCTION 
1.1 Historical Review | 
1.2 Elements of an Electrical Communication System 4 
1.2.1 Digital Communication System, 7 
1.2.2 Early Work in Digital Communications, 10 
1.3 Communication Channels and Their Characteristics 
1.4 Mathematical Models for Communication Channels 
1.5 Summary and Further Reading 20 


SIGNALS AND LINEAR SYSTEMS 


2.1 


2:2 


2.3 


2.4 


2.5 


2.6 
2.7 


Basic Concepts 21 

2.1.2 Basic Operations on Signals, 21 

2.1.2 Classification of Signals, 23 

2.1.3 Some dmportant Signals and Their Properties, 31 
2.1.4 Classification of Systems, 38 

2.1.5 Analysis of LTI Systems in the Time Domain, 41 


Fourier Series 43 
2.2.1 Fourier Series and Its Properties, 44 


2.2.2 Response of LTI Systems to Periodic Signals, 54 


2.2.3 Parseval’s Relation, 56 


Fourier Transform 58 

2.3.1 From Fourier Series to Fourier Transforms, 58 
2.3.2 Basie Properties of the Fourier Transform, 64 
2.3.3 Fourier Transform for Periodic Signals, 78 
2.3.4 Transmission over LTI Systems, 81 


Filter Design 85 


Power and Energy 89 
2.5.1 Energy-Type Signals, 89 
2.5.2 Power-Type Signals, 92 


Hilbert Transform and Its Properties 95 


Lowpass and Bandpass Signals 98 


12 
18 


xvii 


21 


vii 


viii 


Contents 


2.8 Summary and Further Reading 100 
Problems 101 


AMPLITUDE MODULATION 117 


3.1 Introduction to Modulation 118 


3.2 Amplitude Modulation 119 
3.2.1. Double-Sideband Suppressed-Carrier AM, 119 
3.2.2 Conventional Amplitude Modulation, 126 
3.2.3 Single-Sideband AM, 132 
3.2.4 Vestigial-Sideband AM, 134 


3.3 Implementation of Amplitude Modulators and Demodulators 137 


3.4 Signal Multiplexing 144 
3.4.1. Frequency-Division Multiplexing, 144 
3.4.2 Quadrature-Carrier Multiplexing, 145 


3.5 AM Radio Broadcasting 146 

3.6 Summary and Further Reading 149 
Appendix 3A: Derivation of the Expression forSSB-AM Signals 149 
Problems 151 


ANGLE MODULATION 167 


4.1 Representation of FM and PM Signals 161 


4.2 Spectral Characteristics of Angle-Modulated Signals 166 
4.2.1 Angle Modulation by a Sinusoidal Signal, 166 
4.2.2 Angle Modulation by an Arbitrary Message Signal, 170 


4.3 Implementation of Angle Modulators and Demodulators 171 
4.4 FM Radio Broadcasting 179 
4.5 Summary and Further Reading 181 

Problems 182 


PROBABILITY AND RANDOM PROCESSES 190 


5.1 Review of Probability and Random Variables 190 
5.1.1 Sample Space, Events, and Probability, 190 
5.1.2 Conditional Probability, 191 
5.1.3 Random Variables, 194 
5.1.4 Functions of a Random Variable, 201 
5.1.5 Multiple Random Variables, 203 
5.1.6 Sums of Random Variables, 208 


Contents ix 


5.2 Random Processes: Basic Concepts 209 
5.2.1 Statistical Averages, 212 
5.2.2 Wide-Sense Stationary Processes, 215 
5.2.3 Multiple Random Processes, 217 
5.2.4 Random Processes and Linear Systems, 218 
5.2.5 Power Spectral Density of Stationary Processes, 220 
5.2.6 Power Spectral Density of a Sum Process, 225 


5.3 Gaussian and White Processes 226 
5.3.1 Gaussian Processes, 226 
5.3.2 White Processes, 228 
5.3.3 Filtered Noise Processes, 230 


5.4 Summary and Further Reading 235 
Problems 236 


6 EFFECT OF NOISE ON ANALOG COMMUNICATION SYSTEMS 255 


6.1 Effect of Noise on Amplitude Modulation Systems 255 
6.1.1 Effect of Noise on a Baseband System, 256 
6.1.2 Effect of Noise on DSB-SC AM, 256 
6.1.3 Effect of Noise on SSB AM, 258 
6.1 4 Effect of Noise on Conventional AM, 259 


6.2 Effect of Noise on Angle Modulation 263 
6.2.1 Threshold Effect in Angle Modulation, 271 
6.2.2 Preenphasis and Deemphasis Filtering for FM, 274 


6.3 Comparison of Analog-Modulation Systems 277 


6.4 Effects of Transmission Losses and Noise in Analog Communication 
Systems 278 
6.4.1 Characterization of Thermal Noise Sources, 279 
6.4.2 Effective Noise Temperature and Noise Figure, 280 
6.4.3 Transmission Losses, 283 
6.4.4 Repeaters for Signal Transmission, 284 


6.5 Summary and Further Reading 287 
Problems 288 


7 ANALOG-TO-DIGITAL CONVERSION 296 
7.1 Sampling of Signals and Signal Reconstruction from Samples 297 


7.1.1 The Sampling Theorem, 297 


7.2 Quantization 301 
7.2.1 Scalar Quantization, 302 
7.2.2 Vector Quantization, 309 


7.3 
7.4 


7.5 
7.6 


7.7 
7.8 


Contents 


Encoding 311 


Waveform Coding 312 

7.4.1 Pulse Code Modulation, 313 

7.4.2 Differential Pulse Code Modulation, 316 
7.4.3 Delta Modulation, 318 


Analysis—Synthesis Techniques 321 


Digital Audio Transmission and Digital Audio Recording 325 
7.6.1 Digital Audio in Telephone Transmission Systems, 325 
7.6.2 Digital Audio Recording, 327 


The JPEG Image-Coding Standard 332 
Summary and Further Reading 335 
Problems 336 


DIGITAL MODULATION METHODS IN AN ADDITIVE WHITE 
GAUSSIAN NOISE CHANNEL 


8.1 
8.2 


8.3 


8.4 


8.5 


8.6 


Geometric Representation of Signal Waveforms 348 


Binary Modulation Schemes 352 
8.2.1 Binary Antipodal Signaling, 352 
8.2.2 Binary Orthogonal Signaling, 356 


Optimum Receiver for Binary Modulated Signals in Additive White 
Gaussian Noise 361 

8.3.1 Correlation- Type Demodulator, 362 

8.3.2 Matched-Filter- Type Demodulator, 371 

8.3.3 The Performance of the Optimum Detector for Binary Signals, 379 


M-ary Digital Modulation 384 
8.4.1 The Optinuun Receiver for M-ary Signals in AWGN, 384 
8.4.2 A Union Bound on the Probability of Error, 396 


M-ary Pulse Amplitude Modulation 398 
8.5.1 Carrier-Modulated PAM for Bandpass Channels (M-ary ASK), 400 
8.5.2 Demodulation and Detection of Amplitude-Modulated 
PAM Signals, 403 
8.5.3 Probability of Error for M-ary PAM, 403 


Phase-Shift Keying 406 

8.6.1 Geometric Representation of PSK Signals, 408 

8.6.2 Demodulation and Detection of PSK Signals, 410 

8.6.3 Probability of Error for Phase-Coherent PSK Modulation, 411 

8.6.4 Differential Phase Encoding and Differential Phase Modulation 
and Demodulation, 416 

8.6.5 Probability of Error for DPSK, 418 


347 


Contents 


8.7 


8.8 


8.9 


8.10 
8.11 


9.1 


9.2 


9.3 


9.4 


9.5 


9.6 


Quadrature Amplitude-Modulated Digital Signals 419 
8.7.1 Geometric Representation of QAM Signals, 421 

8.7.2 Demodulation and Detection of QAM Signals, 423 
8.7.3 Probability of Error for QAM, 424 


Carrier-Phase Estimation 429 

8.8.1 The Phase-Locked Loop, 429 

8.8.2 The Costas Loop, 437 

8.8.3 Carrier-Phase Estimation for PAM, 439 
8.8.4 Carrier-Phase Estimation for PSK, 440 

8.8.5 Carrier-Phase Estimation for QAM, 444 


Symbol Synchronization 446 

8.9.1 Early—Late Gate Synchronizers, 447 

8.9.2 Minimum Mean Square Error Method, 450 

8.9.3. Maximum-Likelihood Method, 451 

8.9.4 Spectral-Line Method, 452 

8.9.5 Symbol Synchronization for Carrier-Modulated Signals, 455 


Regenerative Repeaters 456 
Summary and Further Reading 457 
Problems 459 


MULTIDIMENSIONAL DIGITAL MODULATION 


M-ary Orthogonal Signals 485 

9.1.1 Probability of Error for M-ary Orthogonal Signals, 488 

9.1.2 A Union Bound on the Error Probability of M-ary Orthogonal 
Signals, 49] 


Biorthogonal Signals 492 
9.2.1 Probability of Error for M-ary Biorthogonal Signals, 495 


Simplex Signals 497 
9.3.1 Probability of Error for M-ary Simplex Signals, 498 


Binary-Coded Signals 499 
9.4.1 Probability of Error for Binary -Coded Signals, 501 


Frequency-Shift Keying 501 

9.5.1 Demodulation of M-ary FSK, 503 

9.5.2 Optimuim Detector for Noncoherent Binary FSK, 507 

9.5.3 Probability of Error for Noncoherent Detection of M-ary FSK, 510 


Modulation Systems with Memory 513 
9.6.1 Continuous-Phase FSK, 513 
9.6.2 Spectral Characteristics of CPFSK Signals, 524 


xi 


485 


xii 


10 


11 


9.7 Comparison of Modulation Methods 525 
9.8 Summary and Further Reading 532 
Problems 533 


DIGITAL TRANSMISSION THROUGH BANDLIMITED AWGN CHANNELS 


Contents 


543 


10.1 


10.2 
10.3 


10.4 


10.5 


10.6 


Characterization of Bandlimited Channels and Signal Distortion 543 
10.1.1 Intersymbol Interference in Signal Transmission, 547 
10.1.2 Digital Transmission through Bandlimited Bandpass Channels, 549 


The Power Spectrum of Digitally Modulated Signals 552 


Signal Design for BandlimitedChannels 556 

10.3.1 Design of Bandlimited Signals for Zero [SI—The Nyquist 
Criterion, 558 

10.3.2 Design of Bandlimited Signals with Controlled [SI—Partial-Res ponse 
Signals, 564 


Detection of Partial-Response Signals 566 

10.4.1 Symbol-by-Symbol Detection, 567 

10.4.2 Probability of Error for Symbol-by-Symbol Detection, 570 

10.4.3 Maximum-Likelihood Sequence Detection of Partial-Res ponse 
Signals, 573 

10.4.4 Error Probability of the Maximum-Likelihood Sequence 
Detector, 576 


System Design in the Presence of Channel Distortion 577 

10.5.1 Design of Transmitting and Receiving Filters for a Known 
Channel, 578 

10.5.2 Channel Equalization, 582 


Summary and Further Reading 599 


Appendix 10A: Power Spectrum of Modulated Signals 601 
10A.I The Power Spectrum of the Baseband Signal, 601 
10A.2 The Power Spectrum of the Carrier Modulated Signals, 603 


Problems 604 


MULTICARRIER MODULATION AND OFDM 621 


Orthogonal Frequency-Division Multiplexing 621 
Modulation and Demodulation in an OFDM System 622 
An OFDM System Implemented via the FFT Algorithm 626 
Spectral Characteristics of OFDM Signals 629 


Contents xiii 
11.5 Peak-to-Average Power Ratio in OFDM Systems 631 
11.6 Applications of OFDM 633 

11.6.1 Digital Subscriber Lines, 633 

11.6.2 Wireless LANs, 635 

11.6.3 Digital Audio Broadcasting, 636 
11.7 Summary and Further Reading 636 

Problems 637 

12 AN INTRODUCTION TO INFORMATION THEORY 647 

12.1 Modeling Information Sources 642 
12.1.1 Measure of Information, 644 
12.1.2 Joint and Conditional Entropy, 647 
12.1.3 Mutual Information, 650 
12.1.4 Differential Entropy, 650 
12.2 The Source Coding Theorem 652 
12.3 Source Coding Algorithms 655 
12.3.1 The Huffman Source Coding Algorithm, 655 
12.3.2 The Lempel-Ziv Source Coding Algorithm, 659 
12.4 Modeling of Communication Channels 661 
12.5 Channel Capacity 664 
12.5.1. Gaussian Channel Capacity, 669 
12.6 Bounds on Communication 671 
12.7 Summary and Further Reading 674 
Problems 675 

13 CODING FOR RELIABLE COMMUNICATIONS 689 
13.1 The Promise of Coding 689 
13.2 Linear Block Codes 694 

13.2.1 Decoding and Performance of Linear Block Codes, 700 
13.2.2 Some Important Linear Block Codes, 707 
13.2.3 Error Detection versus Error Correction, 708 
13.2.4 Burst-Error-Correcting Codes, 709 
13.3 Convolutional Codes 711 


13.3.1 Basic Properties of Convolutional Codes, 712 

13.3.2 Maximum Likelihood Decoding of Convolutional Codes—The Viterbi 
Algorithm, 717 

13.3.3 Other Decoding Algorithms for Convolutional Codes, 722 

13.3.4 Bounds on the Error Probability of Convolutional Codes, 722 


xiv 


14 


13.4 


13.5 


13.6 


13.7 


13.8 


13.9 


Contents 


Good Codes Based on Combination of Simple Codes 725 
13.4.1 Product Codes, 727 
13.4.2 Concatenated Codes, 728 


Turbo Codes and Iterative Decoding 728 

13.5.1 MAP Decoding of Convolutional Codes—The BCJR Algorithm, 731 
13.5.2 Iterative Decoding for Turbo Codes, 737 

13.5.3 Performance of Turbo Codes, 739 


Low-Density Parity-Check Codes 741 
13.6.1 Decoding LDPC Codes, 745 


Coding for Bandwidth-Constrained Channels 747 
13.7.1 Combined Coding and Modulation, 748 
13.7.2 Trellis-Coded Modulation, 749 


Practical Applications of Coding 756 
13.8.1 Coding for Deep-Space Communications, 756 
13.8.2 Coding for Telephone-Line Modems, 758 


Summary and Further Reading 759 
Problems 760 


DATA TRANSMISSION IN FADING MULTIPATH CHANNELS 769 


14.1 
14.2 


14.3 


14.4 


14.5 
14.6 


Characterization of Physical Wireless Channels 769 


Channel Models for Time-Variant Multipath Channels 771 
14.2.1 Frequency Nonselective Fading Channel, 774 

14.2.2 Frequency Selective Fading Channel, 777 

14.2.3 Models for the Doppler Power Spectrum, 778 

14.2.4 Propagation Models for Mobile Radio Channels, 781 


Performance of Binary Modulation in Rayleigh Fading Channels 783 

14.3.1 Probability of Error in Frequency Nonselective Channels, 783 

14.3.2 Performance Improvement through Signal Diversity, 786 

14.3.3. The RAKE Demodulator and Its Performance in Frequency Selective 
Channels, 792 

14.3.4 OFDM Signals in Frequency Selective Channels, 794 


Multiple Antenna Systems 795 

14.4.1 Channel Models for Multiple Antenna Systems, 796 

14.4.2 Signal Transmission in a Slow Fading Frequency Nonselective MIMO 
Channel, 797 

14.4.3 Detection of Data Symbols in a MIMO System, 799 

14.4.4 Error Rate Performance of the Detectors, 800 

14.4.5 Space-Time Codes for MIMO Systems, 802 


Link Budget Analysis for Radio Channels 810 
Summary and Further Reading 813 
Problems 815 


Contents xv 
15  SPREAD-SPECTRUM COMMUNICATION SYSTEMS 825 


15.1 Model of a Spread-Spectrum Digital Communication System 826 


15.2 Direct Sequence Spread-Spectrum Systems 827 
15.2.1 Effect of Despreading on a Narrowband Interference, 830 
15.2.2 Probability of Error at the Detector, 831 
15.2.3 Performance of Coded Spread-Spectrum Signals, 836 


15.3 Some Applications of DS Spread-Spectrum Signals 836 
15.3.1 Low-Detectability Signal Transmission, 836 
15.3.2 Code Division Multiple Access, 837 
15.3.3) Communication over Channels with Multipath, 838 
15.3.4 Wireless LANs, 839 


15.4 Generation of PN Sequences 840 


15.5 Frequency-Hopped Spread Spectrum 843 
15.5.1 Slow Frequency-Hopping Systems and Partial-Band Interference, 844 
15.5.2 Fast Frequency Hopping, 847 
15.5.3 Applications of FH Spread Spectrum, 848 


15.6 Synchronization of Spread-Spectrum Systems 849 
15.6.1 Acquisition Phase, 849 
15.6.2 Tracking, 852 


g 
15.7 Digital Cellular Communication Systems 856 

15.7.1 The GSM System, 858 

15.7.2- CDMA System Based on IS-95, 862 

15.7.3 Third Generation Cellular Communication Systems and Beyond, 866 


15.8 Summary and Further Reading 868 
Problems 869 


REFERENCES 877 


INDEX 886 


Preface 


This book is intended as a senior-level undergraduate textbook on communication systems 
for Electrical Engineering majors. Its primary objective is to introduce the basic techniques 
used in modern communication systems and to provide fundamental tools and methodolo- 
gies used in the analysis and design of these systems. Although the book is mainly written 
as an undergraduate-level textbook, it can be equally useful to the practicing engineer, or 
as a self-study tool. 

The emphasis of the book is on digital communication systems, which are treated 
in detail in Chapters 7 through 15. These systems are the backbone of modern commu- 
nication systems, including new generations of wireless communication systems, satellite 

„communications, and data transmission networks. Traditional analog communication sys- 
tems are also covered in due detail in Chapters 3, 4, and 6. In addition, the book provides 
detailed coverage of the background required for the course in two chapters, one on linear 
system analysis with emphasis on the frequency-domain approach and Fourier techniques, 
and one on probability, random variables, and random processes. Although these topics are 
now covered in separate courses in the majority of Electrical Engineering programs, it is 
the experience of the authors that the students frequently need to review these topics in a 
course on communications, and therefore it is essential to have quick access to the relevant 
material from thése courses. It is also assumed that the students taking this course have a 
background in calculus, linear algebra, and basic electric circuits. 


NEW TO THIS EDITION 


The following are the major new features in the Second Edition of Fundamentals of Com- 
munication Systems: 


— Major reorganization of basic digital modulation methods based on geometric repre- 
sentation of signals 


— Expanded coverage of carrier phase estimation and symbol synchronization 
— New chapter on multicarrier modulation and OFDM 


New and expanded coverage of iterative decoding of turbo codes and LDPC codes 


New section on multiple antenna (MIMO) systems for radio channels 


New chapter on spread spectrum signals and systems 


ORGANIZATION OF THE BOOK 


The book starts with a brief review of communication systems in Chapter 1, followed by 
methods of signal representation and system analysis in both time and frequency domains 


xvii 


xviii Preface 


in Chapter 2. Emphasis is placed on the Fourier series and the Fourier transform represen- 
tation of signals and the use of transforms in linear systems analysis. 

Chapters 3 and 4 cover the modulation and demodulation of analog signals. In 
Chapter 3, amplitude modulation (AM) is covered. In Chapter 4, frequency modulation 
(FM) and phase modulation (PM) are covered. AM and FM radio broadcasting are also 
treated in these chapters. 

In Chapter 5, we present a review of the basic definitions and concepts in proba- 
bility and random processes. Special emphasis is placed on Gaussian random processes, 
which provide mathematically treatable models for additive noise disturbances. Both time- 
domain and frequency-domain representations of random signals are presented. 

Chapter 6 covers the effects of additive noise in the demodulation of amplitude- 
modulated (AM) and angle-modulated (FM, PM) analog signals and a comparison of these 
analog signal modulations in terms of their signal-to-noise ratio performance. We also 
present the characterization of thermal noise and the effect of transmission losses in analog 
communication systems. 

Chapter 7 is devoted to analog-to-digital conversion. The sampling theorem and 
quantization techniques are treated first, followed by waveform encoding methods includ- 
ing PCM, DPCM, and DM. This chapter concludes with brief discussion of LPC speech 
decoding and the JPEG standard for image compression. 

Chapter 8 treats basic digital modulation methods and their performance in AWGN 
channels. The methods described are binary antipodal and orthogonal signals, and M-ary 
pulse amplitude modulation (PAM), phase-shift keying (PSK), and quadrature amplitude 
modulation (QAM). These types of digital signals are characterized in terms of their geo- 
metric representation. The optimum demodulation of these signals is derived based on 
the maximum a posteriori and maximum-likelihood criteria. In addition, we also describe 
methods for carrier phase estimation using a phase-locked loop (PLL), and symbol syn- 
chronization. 

In Chapter 9, we treat multidimensional digital modulation signals based on a geo- 
metric representation of such signals and derive their performance when transmitted in 
an AWGN channel. Signal types considered include orthogonal signals, biorthogonal sig- 
nals, simplex signals, binary-coded signals, and frequency-shift keying (FSK). Continuous- 
phase FSK (CPFSK) and its spectral characteristics are also treated. 

In Chapter 10, we consider the transmission of digital signals in bandlimited AWGN 
channels. The effect of channel distortion on the transmitted signals is shown to result 
in intersymbol interference (ISI). Then, the design of signals that eliminate or control the 
effect of ISI is described. Finally, we treat the design of adaptive equalizers for suppressing 
ISI in the channel distorted received signal. 

The focus of Chapter 11 is on digital signal transmission via multicarrier modula- 
tion and orthogonal frequency-division multiplexing (OFDM). The implementation of the 
OFDM modulator and demodulator, based on the use of the FFT algorithm, is described. 
Additional topics treated include the spectral characteristics of OFDM signals and methods 
for reducing the peak-to-average power ratio (PAR) in OFDM signals. Finally, we present 
several applications of OFDM in current digital communication systems. 


Preface xix 


In Chapter 12, we present the basic limits on communication of information, includ- 
ing the information content of memoryless sources, efficient coding of the source out- 
put, and the capacity of the AWGN channel. Two widely used algorithms for encoding 
the output of digital sources, namely, the Huffman coding algorithm and the Lempel—Ziv 
algorithm, are also described. 

In Chapter 13, we treat channel coding and decoding. Linear block codes and con- 
volutional codes are described for enhancing the performance of a digital communication 
system in the presence of AWGN. Both hard-decision and soft-decision decoding of block 
and convolutional codes are treated. Coding for bandwidth-limited channels (trellis-coded 
modulation), turbo codes, and low-density parity check codes are also treated. 

In Chapter 14, we treat the characterization of physical wireless channels and the 
construction of mathematical models for time-varying, fading multipath channels. The per- 
formance of binary modulation in Rayleigh fading channels is determined and the benefits 
of signal diversity for combating signal fading is demonstrated. The RAKE demodulator is 
‘described and its performance on frequency selective channels is evaluated. Also treated in 
this chapter is the use of multiple transmit and receive antennas for increasing the transmis- 
sion rate and obtaining signal diversity in wireless communication systems. Methods for 
mapping digital signals for transmission on multiple antennas are also presented, including 
block coding methods such as the Alamouti code and trellis codes. The final topic treated 
in this chapter is link budget analysis for radio channels. 

The final chapter of this book introduces the reader to spread-spectrum digital com- 
munication techniques and their use in combating interference, both intentional (jamming) 
and unintentional, the latter arising from other users of the channel. In particular, we treat 
direct sequence (DS) spread spectrum and frequency-hopping (FH) spread spectrum, and 
their performance characteristics in the presence of interference. Also treated is the gener- 
ation of pseudo-noise (PN) sequences for use in spreading the spectrum of the transmitted 
signal. The final topic of this chapter describes the use of spread spectrum signals in digital 
cellular communication systems, including 2nd-, 3rd-, and 4th-generation (2G, 3G, 4G) 
cellular systems. 

Throughout the book many worked examples are provided to emphasize the use of 
the techniques developed in theory. Following each chapter are a large number of prob- 
lems at different levels of difficulty. The problems are followed by a selection of com- 
puter problems, which usually ask for simulation of various algorithms developed in that 
chapter using MATLAB. The solutions to the MATLAB problems are made available at 
www.pearsonhighered.com. 
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Introduction 





¿Every day, in our work and in our leisure time, we use and come in contact with a variety 
of modern communication systems and communication media, the most common being 
the telephone, radio, and television. Through these media, we are able to communicate 
(nearly) instantaneously with people on different continents, transact our daily business, 
and receive information about various developments and noteworthy events that occur all 
around the world. Electronic mail and facsimile transmission have made it possible to 
rapidly communicate written messages across great distances. 

Can you imagine a world without telephones, radios, and televisions? Yet, when you 
think about it, most of these modern communication systems were invented and developed 
during the past century. Here, we present a brief historical review of major developments 
within the last 200 yearsthat have had a major role in the development of modern commu- 
nication systems. ` 


1.1 HISTORICAL REVIEW 


Telegraphy and Telephony. One of the earliest inventions of major significance 
to communications was the invention of the electric battery by Alessandro Volta in 1799. 
This invention made it possible for Samuel Morse to develop the electric telegraph, which 
he demonstrated in 1837. The first telegraph line linked Washington with Baltimore and 
became operational in May 1844. Morse devised the variable-length binary code given in 
Table 1.1, in which letters of the English alphabet were represented by a sequence of dots 
and dashes (code words). In this code, more frequently occurring letters are represented 
by short code words, while less frequently occurring letters are represented by longer code 
words. 

The Morse code was the precursor to the variable-length source coding method, 
which is described in Chapter 12. It is remarkable that the earliest form of electrical com- 
munications that was developed by Morse, namely, telegraphy, was a binary digital com- 
munication system in which the letters of the English alphabet were efficiently encoded 
into corresponding variable-length code words with binary elements. 
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TABLE 1.1 THE MORSE CODE 

A — N — 

B — O ——— 

C —— P —— 

D — Q — —:— 1 -——— — 

E R za De meie 

F — S 3 2 

G —— Ca Ae Sanit aas 

H U — ETER 

I vV — Oo eee 

i) tae w .—— a 

K —-— Xe el OP ea aro 

L p Y —-—— 9 ————. 

M —— Z -n OSS Saas 

(a) Letters (b) Numbers 

Period(.) sw ee — Wait sign (AS) fee sts 
Comma () 0 ee ee Double dash (break) —...— 


Interrogation (?) 
Quotation Mark (") cij pa 
Colon (:) sipa aea 


Error sign 
Fraction bar (/) EEE 
End of message (AR) 


Semicolon (;) ma hee oe End of transmission (SK) 


Parenthesis ( ) 





(c) Punctuation and special characters 





Nearly 40 years later, in 1875, Emile Baudot developed a code for telegraphy in 
which each letter was encoded into fixed-length binary code words of length 5. In the 
Baudot code, the binary code elements have equal length and are designated as mark and 
space. 

An important milestone in telegraphy was the installation of the first transatlantic 
cable that linked the United States and Europe in 1858. This cable failed after about four 
weeks of operation. A second cable was laid a few years later and became operational in 
July 1866. 

Telephony came into being with the invention of the telephone in the 1870s. Alexan- 
der Graham Bell patented his invention of the telephone in 1876; in 1877, established the 
Bell Telephone Company. Early versions of telephone communication systems were rela- 
tively simple and provided service over several hundred miles. Significant advances in the 
quality and range of service during the first two decades of the twentieth century resulted 
from the invention of the carbon microphone and the induction coil. 


Section 1.1 Historical Review 3 


In 1906, the invention of the triode amplifier by Lee DeForest made it possible to 
introduce signal amplification in telephone communication systems and, thus, to allow for 
telephone signal transmission over great distances. For example, transcontinental telephone 
transmission became operational in 1915. 

The two world wars and the Great Depression during the 1930s must have been a 
deterrent to the establishment of transatlantic telephone service. It was not until 1953, when 
the first transatlantic cable was laid, that telephone service became available between the 
United States and Europe. 

Automatic switching was another important advance in the development of tele- 
phony. The first automatic switch, developed by Strowger in 1897, was an electrome- 
chanical step-by-step switch. This type of switch was used for several decades. With the 
invention of the transistor, electronic (digital) switching became economically feasible. 
After several years of development at the Bell Telephone Laboratories, a digital switch 

„Was placed in service in Illinois in June 1960. 

During the past 50 years, there have been significant advances in telephone commu- 
nications. Fiber optic cables are rapidly replacing copper wire in the telephone plant, and 
electronic switches have replaced the old electromechanical systems. 


Wireless Communications. The development of wirelesscommunications stems 
from the works of Oersted, Faraday, Gauss, Maxwell, and Hertz during the nineteenth cen- 
tury. In 1820, Oersted demonstrated that an electric current produces a magnetic field. On 
August 29, 1831; Michael Faraday showed that an induced current is produced by moving 
a magnet in the vicinity of a conductor. Thus, he demonstrated that a changing magnetic 
field produces an electric field. With this early work as background, James C. Maxwell in 
1864 predicted the existence of electromagnetic radiation and formulated the basic theory 
that has been in use for over a century. Maxwell’s theory was verified experimentally by 
Hertz in 1887. 

In 1894, a sensitive device that could detect radio signals, called the coherer, was 
used by its inventor, Oliver Lodge, to demonstrate wireless communication over a distance 
of 150 yards in Oxford, England. Guglialmo Marconi is credited with the development of 
wireless telegraphy. In 1895, Marconi demonstrated the transmission of radio signals at a 
distance of approximately 2 km. Two years later, in 1897, he patented a radio telegraph 
system and established the Wireless Telegraph and Signal Company. On December 12, 
1901, Marconi received a radio signal at Signal Hill in Newfoundland; this signal was 
transmitted from Cornwall, England, a city located about 1700 miles away. 

The invention of the vacuum tube was especially instrumental in the development 
of radio communication systems. The vacuum diode was invented by Fleming in 1904, 
and the vacuum triode amplifier was invented by DeForest in 1906, as previously indi- 
cated. In the early part of the twentieth century, the invention of the triode made radio 
broadcast possible. The AM (amplitude modulation) broadcast was initiated in 1920 when 
the radio station KDKA, Pittsburgh, went on the air. From that date, AM radio broad- 
casting grew very rapidly across the country and around the world. The superheterodyne 
AM radio receiver, as we know it today, was invented by Edwin Armstrong during World 
War I. Another significant development in radio communications was the invention of FM 
(frequency modulation), also by Armstrong. In 1933, Armstrong built and demonstrated 


4 Introduction Chapter 1 


the first FM communication system. However, the use of FM was developed more slowly 
than the use of AM broadcast. It was not until the end of World War II that FM broadcast 
gained in popularity and developed commercially. 

The first television system was built in the United States by V. K. Zworykin and 
demonstrated in 1929. Commercial television broadcasting was initiated in London in 1936 
by the British Broadcasting Corporation (BBC). Five years later, the Federal Communica- 
tions Commission (FCC) authorized television broadcasting in the United States. 


The Past 60 Years. The growth in communication services over the past 60 
years has been phenomenal. Significant achievements include the invention of the tran- 
sistor in 1947 by Walter Brattain, John Bardeen, and William Shockley; the integrated 
circuit in 1958 by Jack Kilby and Robert Noyce; and the laser in 1958 by Townes and 
Schawlow. These inventions have made possible the development of small-size, low-power, 
low-weight, and high-speed electronic circuits that are used in the construction of satellite 
communication systems, wideband microwave radio systems, cellular communication sys- 
tems, and light-wave communication systems using fiber optic cables. A satellite named 
Telstar I was launched in 1962 and used to relay TV signals between Europe and the United 
States. Commercial satellite communication services began in 1965 with the launching of 
the Early Bird satellite. 

Currently, most of the wireline communication systems are being replaced by fiber 
optic cables, which provide extremely high bandwidth and make possible the transmis- 
sion of a wide variety of information sources, including voice, data, and video. Cellular 
radio has been developed to provide telephone service to people in automobiles, buses, and 
trains. High-speed communication networks link computers and a variety of peripheral 
devices, literally around the world. 

Today, we are witnessing a significant growth in the introduction and use of personal 
communication services, including voice, data, and video transmission. Satellite and fiber 
optic networks provide high-speed communication services around the world. Indeed, this 
is the dawn of the modern telecommunications era. 

There are several historical treatments in the development of radio and telecommuni- 
cations covering the past century. We cite the books by McMahon, entitled The Making of 
a Profession—A Century of Electrical Engineering in America (IEEE Press, 1984); Ryder 
and Fink, entitled Engineers and Electronics (IEEE Press, 1984); and S. Millman, Edi- 
tor, entitled A History of Engineering and Science in the Bell System—Communications 
Sciences (1925-1980) (AT&T Bell Laboratories, 1984). 


1.2 ELEMENTS OF AN ELECTRICAL COMMUNICATION SYSTEM 


Electrical communication systems are designed to send messages or information from a 
source that generates the messages to one or more destinations. In general, a communica- 
tion system can be represented by the functional block diagram shown in Figure 1.1. The 
information generated by the source may be of the form of voice (speech source), a picture 
(image source), or plain text in some particular language, such as English, Japanese, Ger- 
man, and French. An essential feature of any source that generates information is that its 
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Figure 1.1 Functional diagram of a communication system. 
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output is described in probabilistic terms, i.e., the output of a source is not deterministic. 
Otherwise, there would be no need to transmit the message. 

A transducer is usually required to convert the output of a source into an electrical 
signal that is suitable for transmission. For example, a microphone serves as the trans- 
ducer that converts an acoustic speech signal into an electrical signal, and a video camera 
that converts an image into an electrical signal. At the destination, a similar transducer is 
required to convert the electrical signals that are received into a form that is suitable for the 
user, e.g., acoustic signals and images. 

The heart of the communication system consists of three basic parts, namely, the 
transmitter, the channel ‘and the receiver. The functions performed by these three elements 
are described next. 


The Transmitter. The transmitter converts the electrical signal into a form that 
is suitable for transmission through the physical channel or transmission medium. For 
example, in radio and TV broadcasts, the FCC specifies the frequency range for each 
transmitting station. Hence, the transmitter must translate the outgoing information sig- 
nal into the appropriate frequency range that matches the frequency allocation assigned to 
the transmitter. Thus, signals transmitted by multiple radio stations do not interfere with 
one another. Similar functions are performed in telephone communication systems where 
the electrical speech signals from many users are transmitted over the same wire. 

In general, the transmitter matches the message signal to the channel via a process 
called modulation. Usually, modulation involves the use of the information signal to sys- 
tematically vary either the amplitude or the frequency or the phase of a sinusoidal carrier. 
For example, in AM radio broadcast, the information signal that is transmitted is contained 
in the amplitude variations of the sinusoidal carrier, which is the center frequency in the 
frequency band allocated to the radio transmitting station. This is an example of amplitude 
modulation. In an FM radio broadcast, the information signal that is transmitted is con- 
tained in the frequency variations of the sinusoidal carrier. This is an example of frequency 
modulation. Phase modulation (PM) is yet a third method for impressing the information 
signal on a sinusoidal carrier. 
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In general, carrier modulation such as AM, FM, and PM is performed at the trans- 
mitter, as previously indicated, to convert the information signal to a form that matches 
the characteristics of the channel. Thus, through the process of modulation, the informa- 
tion signal is translated in frequency to match the allocation of the channel. The choice 
of the type of modulation is based on several factors, such as the amount of bandwidth 
allocated, the types of noise and interference that the signal encounters in transmission 
over the channel, and the electronic devices that are available for signal amplification 
prior to transmission. In any case, the modulation process makes it possible to accom- 
modate the transmission of multiple messages from many users over the same physical 
channel. 

In addition to modulation, other functions that are usually performed at the trans- 
mitter are filtering of the information-bearing signal, amplification of the modulated signal 
and, in the case of wireless transmission, radiation of the signal by means of a transmitting 
antenna. 


The Channel. The communication channel is the physical medium that is used 
to send the signal from the transmitter to the receiver. In wireless transmission, the chan- 
nel is usually the atmosphere (free space). On the other hand, telephone channels usually 
employ a variety of physical media, including wirelines, fiber optic cables, and wireless 
(microwave radio). Whatever the physical medium for signal transmission, the essential 
feature is that the transmitted signal is corrupted in a random manner by a variety of possi- 
ble mechanisms. The most common form of signal degradation comes in the form of addi- 
tive noise, which is generated at the front end of the receiver, where signal amplification 
is performed. This noise is often called thermal noise. In wireless transmission, additional 
additive disturbances are man-made noise and atmospheric noise picked up by a receiv- 
ing antenna. Automobile ignition noise is an example of man-made noise, and electrical 
lightning discharges from thunderstorms is an example of atmospheric noise. Interference 
from other users of the channel is another form of additive noise that often arises in both 
wireless and wireline communication systems. 

In some radio communication channels, such as the ionospheric channel that is used 
for long-range, short-wave radio transmission, another form of signal degradation is mul- 
tipath propagation. Such signal distortion is characterized as a nonadditive signal distur- 
bance that manifests itself as time variations in the signal amplitude, usually called fading. 
This phenomenon is described in more detail in Section 1.3. 

Both additive and nonadditive signal distortions are usually characterized as random 
phenomena and described in statistical terms. The effect of these signal distortions must be 
considered in the design of the communication system. 

In the design of a communication system, the system designer works with mathe- 
matical models that statistically characterize the signal distortion encountered on physical 
channels. Often, the statistical description that is used in a mathematical model is a result 
of actual empirical measurements obtained from experiments involving signal transmis- 
sion over such channels. In such cases, there is a physical justification for the mathematical 
model used in the design of communication systems. On the other hand, in some commu- 
nication system designs, the statistical characteristics of the channel may vary significantly 
with time. In such cases, the system designer may design a communication system that is 
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robust to the variety of signal distortions. This can be accomplished by having the system 
adapt some of its parameters to the channel distortion encountered. 


The Receiver. The function of the receiver is to recover the message signal con- 
tained in the received signal. If the message signal is transmitted by carrier modulation, 
the receiver performs carrier demodulation to extract the message from the sinusoidal 
carrier. Since the signal demodulation is performed in the presence of additive noise and 
possibly other signal distortions, the demodulated message signal is generally degraded to 
some extent by the presence of these distortions in the received signal. As we shall see, 
the fidelity of the received message signal is a function of the type of modulation and the 
strength of the additive noise. 

Besides performing the primary function of signal demodulation, the receiver also 
performs a number of peripheral functions, including signal filtering and noise suppression. 


1.2.1 Digital Communication System 


Up to this point, we have described an electrical communication system in rather broad 
terms based on the implicit assumption that the message signal is a continuous time-varying 
waveform. We refer to such continuous-time signal waveforms as analog signals and to the 
corresponding information sources that produce such signals as analog sources. Analog 
signals can be transmitted directly over the communication channel via carrier modulation 
and demodulated accordingly at the receiver. We call such a communication system an 
analog communication system. 

Alternatively, an analog source output may be converted into a digital form, and the 
message can be transmitted via digital modulation and demodulated as a digital signal 
at the receiver. There are some potential advantages to transmitting an analog signal by 
means of digital modulation. The most important reason is that signal fidelity is better 
controlled through digital transmission than through analog transmission. In particular, 
digital transmission allows us to regenerate the digital signal in long-distance transmission, 
thus eliminating effects of noise at each regeneration point. In contrast, the noise added in 
analog transmission is amplified along with the signal when amplifiers are periodically 
used to boost the signal level in long-distance transmission. Another reason for choosing 
digital transmission over analog is that the analog message signal may be highly redundant. 
With digital processing, redundancy may be removed prior to modulation, thus conserving 
channel bandwidth. Yet a third reason may be that digital communication systems are often 
cheaper to implement. 

In some applications, the information to be transmitted is inherently digital, e.g., in 
the form of English text and computer data. In such cases, the information source that 
generates the data is called a discrete (digital) source. 

In a digital communication system, the functional operations performed at the trans- 
mitter and receiver must be expanded to include message signal discretization at the trans- 
mitter and message signal synthesis or interpolation at the receiver. Additional functions 
include redundancy removal, as well as channel coding and decoding. 

Figure 1.2 illustrates the functional diagram and the basic elements of a digital com- 
munication system. The source output may be either an analog signal, such as an audio 
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Figure 1.2 Basic elements of a digital communication system. 



































or video signal, or a digital signal, such as computer output, which is discrete in time and 
has a finite number of output characters. In a digital communication system, the messages 
produced by the source are usually converted into a sequence of binary digits. Ideally, we 
would like to represent the source output (message) with as few binary digits as possible. In 
other words, we seek an efficient representation of the source output that results in little or 
no redundancy. The process of efficiently converting the output of either an analog or a dig- 
ital source into a sequence of binary digits is called source encoding or data compression. 
We shall describe source-encoding methods in Chapter 12. 

The source encoder outputs a sequence of binary digits, which we call the informa- 
tion sequence; this is passed to the channel encoder. The purpose of the channel encoder is 
to introduce, in a controlled manner, some redundancy in the binary information sequence 
that can be used at the receiver to overcome the effects of noise and interference encoun- 
tered in the transmission of the signal through the channel. Thus, the added redundancy 
serves to increase the reliability of the received data and improves the fidelity of the 
received signal. In effect, redundancy in the information sequence aids the receiver in 
decoding the desired information sequence. For example, a (trivial) form of encoding of 
the binary information sequence is simply to repeat each binary digit m times, where m is 
some positive integer. More sophisticated (nontrivial) encoding involves taking k informa- 
tion bits at a time and mapping each k-bit sequence into a unique n-bit sequence, called 
a code word. The amount of redundancy introduced by encoding the data in. this man- 
ner is measured by the ratio n/k. The reciprocal of this ratio, namely, k/n, is called the 
rate of the code or, simply, the code rate. Channel coding and decoding are discussed in 
Chapter 13. 

The binary sequence at the output of the channel encoder is passed to the digital 
modulator, which serves as the interface to the communication channel. Since nearly all of 
the communication channels encountered in practice are capable of transmitting electrical 
signals (waveforms), the primary purpose of the digital modulator is to map the binary 
information sequence into signal waveforms. To elaborate on this point, let us suppose that 
the coded information sequence is to be transmitted one bit at a time at some uniform rate 
R bits/sec. The digital modulator may simply map the binary digit 0 into a waveform so(t) 
and the binary digit 1 into a waveform s(t). In this manner, each bit from the channel 
encoder is transmitted separately. We call this binary modulation. Alternatively, the modu- 
lator may transmit b coded information bits at a time by using M = 2* distinct waveforms 
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s;(t),i =0,1,..., M — 1. This provides one waveform for each of the 2% possible k-bit 
sequences. We call this M-ary modulation (M > 2). Note that a new k-bit sequence enters 
the modulator every k/R seconds. Hence, when the channel bit rate R is fixed, the amount 
of time available to transmit one of the M waveforms corresponding to a k-bit sequence is 
k times the time period in a system that uses binary modulation. 

At the receiving end of a digital communication system, the digital demodulator pro- 
cesses the channel-corrupted transmitted waveform and reduces each waveform to a sin- 
gle number that represents an estimate of the transmitted data symbol (binary or M-ary). 
For example, when binary modulation is used, the demodulator may process the received 
waveform and decide whether the transmitted bit is a O or a 1. In such a case, we say the 
demodulator has made a binary decision or hard decision. As an alternative, the demod- 
ulator may make a ternary decision; i.e., it decides that the transmitted bit is either a 0 or 
1 or it makes no decision at all, depending on the apparent quality of the received signal. 

„When no decision is made on a particular bit, we say that the demodulator has inserted an 

erasure in the demodulated data. Using the redundancy in the transmitted data, the decoder 
attempts to fill in the positions where erasures occurred. Viewing the decision process per- 
formed by the demodulator as a form of quantization, we observe that binary and ternary 
decisions are special cases of a demodulator that quantizes to Q levels, where Q > 2. In 
general, if the digital communication system employs M-ary modulation, where M rep- 
resents the M possible transmitted symbols, each corresponding to k = log, M bits, the 
demodulator may make a Q-ary decision, where Q > M. In the extreme case where no 
quantization is performed, Q = oo. 

When there is no redundancy in the transmitted information, the demodulator must 
decide which of the M -waveforms was transmitted in any given time interval. Conse- 
quently Q = M, and since there is no redundancy in the transmitted information, no 
discrete channel decoder is used following the demodulator. On the other hand, when there 
is redundancy introduced by a discrete channel encoder at the transmitter, the Q-ary output 
from the demodulator occurring every k/R seconds is fed to the decoder, which attempts 
to reconstruct the original information sequence from knowledge of the code used by the 
channel encoder and the redundancy contained in the received data. 

A measure of how well the demodulator and decoder perform is the frequency with 
which errors occur in the decoded sequence. More precisely, the average probability of a 
bit error at the output of the decoder is a measure of the performance of the demodulator— 
decoder combination. In general, the probability of error is a function of the code char- 
acteristics, the types of waveforms used to transmit the information over the channel, the 
transmitter power, the characteristics of the channel, i.e., the amount of noise, and the 
method of demodulation and decoding. 

These items and their effect on performance will be discussed in detail in Chapters 8 
through 10. 

As a final step, when an analog output is desired, the source decoder accepts the 
output sequence from the channel decoder and, from knowledge of the source encoding 
method used, attempts to reconstruct the original signal from the source. Due to channel 
decoding errors and possible distortion introduced by the source encoder and, perhaps, 
the source decoder, the signal at the output of the source decoder is an approximation to 
the original source output. The difference or some function of the difference between the 
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original signal and the reconstructed signal is a measure of the distortion introduced by the 
digital communication system. 


1.2.2 Early Work in Digital Communications 


Although Morse is responsible for the development of the first electrical digital commu- 
nication system (telegraphy), the beginnings of what we now regard as modern digital 
communications stem from the work of Nyquist (1924), who investigated the problem of 
determining the maximum signaling rate that can be used over a telegraph channel of a 
given bandwidth without intersymbol interference. He formulated a model of a telegraph 
system in which a transmitted signal has the general form 


s(t)= > ang(t — nT), 


n 


where g(t) represents a basic pulse shape and {a,} is the binary data sequence of {+1} 
transmitted at a rate of 1/T bits/sec. Nyquist set out to determine the optimum pulse shape 
that was bandlimited to W Hz and maximized the bit rate 1/T under the constraint that the 
pulse caused no intersymbol interference at the sampling times k/T,k = 0, +1, +2,.... 
His studies led him to conclude that the maximum pulse rate 1/T is 2W pulses/sec. This 
rate is now called the Nyquist rate. Moreover, this pulse rate can be achieved by using the 
pulses g(t) = (sin2mWrt) /2nWt. This pulse shape allows the recovery of the data without 
intersymbol interference at the sampling instants. Nyquist’s result is equivalent to a ver- 
sion of the sampling theorem for bandlimited signals, which was later stated precisely by 
Shannon (1948). The sampling theorem states that a signal of bandwidth W can be recon- 
structed from samples taken at the Nyquist rate of 2W samples/sec using the interpolation 
formula 


ie 3 2 (A sin2nW (t Tn] 2w) 
: 2W 2nW (t —n/2W) 

In light of Nyquist’s work, Hartley (1928) considered the amount of data that can be 
reliably transmitted over a bandlimited channel when multiple amplitude levels are used. 
Due to the presence of noise and other interference, Hartley postulated that the receiver can 
reliably estimate the received signal amplitude to some accuracy, say As. This investigation 
led Hartley to conclude that there is a maximum data rate that can be communicated reli- 
ably over a bandlimited channel, when the maximum signal amplitude is limited to A max 
(fixed power constraint) and the amplitude resolution is As. 

Another significant advance in the development of communications was the work 
of Wiener (1942), who considered the problem of estimating a desired signal waveform 
s(t) in the presence of additive noise n(t) based on observation of the received signal 
r(t) = s(t) + n(t). This problem arises in signal demodulation. Wiener determined the 
linear filter whose output is the best mean-square approximation to the desired signal s(t). 
The resulting filter is called the optimum linear (Wiener) filter. 

Hartley’s and Nyquist’s results on the maximum transmission rate of digital informa- 
tion were precursors to the work of Shannon (1948a,b), who established the mathematical 
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foundations for information theory and derived the fundamental limits for digital commu- 
nication systems. In his pioneering work, Shannon formulated the basic problem of reliable 
transmission of information in statistical terms, using probabilistic models for information 
sources and communication channels. Based on this statistical formulation, he adopted a 
logarithmic measure for the information content of a source. He also demonstrated that 
the effect of a transmitter power constraint, a bandwidth constraint, and additive noise can 
be associated with the channel and incorporated into a single parameter, called the chan- 
nel capacity. For example, in the case of an additive white (spectrally flat) Gaussian noise 
interference, an ideal bandlimited channel of bandwidth W has a capacity C given by 





P 
C = W log, (1 + WN ) bits/sec, 


0 


where P is the average transmitted power and Nọ is the power-spectral density of the addi- 
tive noise. The significance of the channel capacity is as follows: If the information rate R 
from the source is less than C (R < C), then it is theoretically possible to achieve reliable 
(error-free) transmission through the channel by appropriate coding. On the other hand, if 
R > C, reliable transmission is not possible regardless of the amount of signal process- 
ing performed at the transmitter and receiver. Thus, Shannon established basic limits on 
communication of information and gave birth to a new field that is now called information 
theory. č 

Initially, the fundamental work of Shannon had a relatively small impact on the 
design and development,of new digital communication systems. In part, this was due to 
the small demand for digital information transmission during the 1950s. Another reason 
was the relatively large complexity and, hence, the high-cost digital hardware required to 
achieve the high efficiency and the high reliability predicted by Shannon’s theory. 

Another important contribution to the field of digital communications is the work 
of Kotelnikov (1947). His work provided a coherent analysis of the various digital com- 
munication systems, based on a geometrical approach. Kotelnikov’s approach was later 
expanded by Wozencraft and Jacobs (1965). 

The increase in the demand for data transmission during the last four decades, cou- 
pled with the development of more sophisticated integrated circuits, has led to the devel- 
opment of very efficient and more reliable digital communication systems. In the course of 
these developments, Shannon’s original results and the generalization of his results on max- 
imum transmission limits over a channel and on bounds on the performance achieved have 
served as benchmarks against which any given communication system design can be com- 
pared. The theoretical limits, derived by Shannon and other researchers that contributed to 
the development of information theory, serve as an ultimate goal in the continuing efforts 
to design and develop more efficient digital communication systems. 

Following Shannon’s publications came the classic work of Hamming (1950), which 
used error-detecting and error-correcting codes to combat the detrimental effects of channel 
noise. Hamming’s work stimulated many researchers in the years that followed, and a 
variety of new and powerful codes were discovered, many of which are used today in the 
implementation of modern communication systems. 


12 Introduction Chapter 1 
1.3 COMMUNICATION CHANNELS AND THEIR CHARACTERISTICS 


As indicated in our preceding discussion, the communication channel provides the con- 
nection between the transmitter and the receiver. The physical channel may be a pair of 
wires that carry the electrical signal, or an optical fiber that carries the information on a 
modulated light beam, or an underwater ocean channel in which the information is trans- 
mitted acoustically, or free space over which the information-bearing signal is radiated by 
use of an antenna. Other media that can be characterized as communication channels are 
data storage media, such as magnetic tape, magnetic disks, and optical disks. 

One common problem in signal transmission through any channel is additive noise. 
In general, additive noise is generated internally by components, such as resistors and 
solid-state devices, used to implement system. This type of noise is usually called thermal 
noise. Other sources of noise and interference may arise externally to the system, such as 
interference from other users of the channel. When such noise and interference occupy the 
same frequency band as the desired signal, the effect can be minimized by proper design of 
the transmitted signal and the demodulator at the receiver. Other types of signal degradation 
may be encountered in transmission over the channel, such as signal attenuation, amplitude 
and phase distortion, and multipath distortion. 

The effects of noise may be minimized by increasing the power in the transmitted 
signal. However, equipment and other practical constraints limit the power level in the 
transmitted signal. Another basic limitation is the available channel bandwidth. A band- 
width constraint is usually due to the physical limitations of the medium and the electronic 
components used to implement the transmitter and the receiver. These two limitations result 
in constraining the amount of data that can be transmitted reliably over any communication 
channel. 

Next, we describe some of the important characteristics of several communication 
channels. 


Wireline Channels. The telephone network makes extensive use of wirelines for 
voice signal transmission, as well as data and video transmission. Twisted-pair wirelines 
and coaxial cable are basically guided electromagnetic channels that provide relatively 
modest bandwidths. Telephone wire generally used to connect a customer to a central office 
has a bandwidth of several hundred kilohertz (kHz). On the other hand, coaxial cable has 
a usable bandwidth of several megahertz (MHz). Figure 1.3 illustrates the frequency range 
of guided electromagnetic channels, which include waveguides and optical fibers. 

Signals transmitted through such channels are distorted in both amplitude and phase, 
and they are further corrupted by additive noise. Twisted-pair wireline channels are also 
prone to crosstalk interference from physically adjacent channels. Because wireline chan- 
nels carry a large percentage of our daily communications around the country and the 
world, much research has been performed on the characterization of their transmission 
properties and on methods for mitigating the amplitude and phase distortion encountered 
in signal transmission. In Chapter 10, we describe methods for designing optimum trans- 
mitted signals and their demodulation, including the design of channel equalizers that com- 
pensate for amplitude and phase distortion. 
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Fiber Optic Channels. Optical fibers offer the communication system designer 
a channel bandwidth that is several orders of magnitude larger than coaxial cable chan- 
nels. During the past decade, researchers have developed optical fiber cables, which have 
a relatively low signal attenuation, and highly reliable photonic devices, which improve 
signal generation and signal detection. These technological advances have resulted in a 
rapid deployment of fiber optic channels both in domestic telecommunication systems 
as well as for transatlantic and transpacific communications. With the large bandwidth 
available on fiber optic channels, it is possible for the telephone companies to offer sub- 
scribers a wide array of telecommunication services, including voice, data, facsimile, 
and video. 
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The transmitter or modulator in a fiber-optic communication system is a light source, 
either a light-emitting diode (LED) or a laser. Information is transmitted by varying (mod- 
ulating) the intensity of the light source with the message signal. The light propagates 
through the fiber as a light wave and is amplified periodically (in the case of digital trans- 
mission, it is detected and regenerated by repeaters) along the transmission path to compen- 
sate for signal attenuation. At the receiver, the light intensity is detected by a photodiode, 
whose output is an electrical signal that varies in direct proportion to the power of the light 
impinging on the photodiode. 

It is envisioned that fiber optic channels will replace nearly all wireline channels in 
the telephone network in the next few years. 


Wireless Electromagnetic Channels. In radio communication systems, elec- 
tromagnetic energy is coupled to the propagation medium by an antenna, which serves 
as the radiator. The physical size and the configuration of the antenna depend primar- 
ily on the frequency of operation. To obtain efficient radiation of electromagnetic energy, 
the antenna must be longer than 1/10 of the wavelength. Consequently, a radio station 
transmitting in the AM frequency band, say, at 1 MHz (corresponding to a wavelength of 
A = c/ fe = 300 m) requires an antenna of at least 30 meters. 

Figure 1.4 illustrates the various frequency bands of the electromagnetic spectrum. 
The mode of propagation of electromagnetic waves in the atmosphere and in free space 
may be subdivided into three categories, namely, ground-wave propagation, sky-wave 
propagation, and line-of-sight (LOS) propagation. In the very low frequency (VLF) and 
extremely low frequency bands where the wavelengths exceed 10 kilometers, the earth 
and the ionosphere act as a waveguide for electromagnetic wave propagation. In these fre- 
quency ranges, communication signals practically propagate around the globe. For this 
reason, these frequency bands are primarily used to provide navigational aids from shore 
to ships around the world. The channel bandwidths available in these frequency bands are 
relatively small (usually 1%—10% of the center frequency); hence, the information that is 
transmitted through these channels is relatively of slow speed and generally confined to 
digital transmission. A dominant type of noise at these frequencies is generated from thun- 
derstorm activity around the globe, especially in tropical regions. Interference results from 
the many users of these frequency bands. 

Ground-wave propagation, illustrated in Figure 1.5, is the dominant mode of prop- 
agation for frequencies in the medium frequency (MF) band (0.3-3 MHz). This is the 
frequency band used for AM broadcasting and maritime radio broadcasting. In AM broad- 
cast, ground-wave propagation limits the range of even the most powerful radio stations to 
about 100 miles. Atmospheric noise, man-made noise, and thermal noise from electronic 
components at the receiver are dominant disturbances for signal transmission at MF. 

Sky-wave propagation, as illustrated in Figure 1.6, results from transmitted signals 
being reflected (bent or refracted) from the ionosphere, which consists of several layers 
of charged particles ranging in altitude from 30 to 250 miles above the surface of the 
earth. During the daytime hours, the heating of the lower atmosphere by the sun causes the 
formation of the lower layers at altitudes below 75 miles. These lower layers, especially the 
D-layer, absorb frequencies below 2 MHz; thus, they severely limit sky-wave propagation 
of AM radio broadcast. However, during the nighttime hours, the electron density in the 
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lower layers of the ionosphere drops sharply and the frequency absorption that occurs 
during the day is significantly reduced. As a consequence, powerful AM radio broadcast 
stations can propagate over large distances via sky wave over the F-layer of the ionosphere, 
which ranges from 90 miles to 250 miles above the surface of the earth. 

A common problem with electromagnetic wave propagation via sky wave in the 
high frequency (HF) range is signal multipath. Signal multipath occurs when the transmit- 
ted signal arrives at the receiver via multiple propagation paths at different delays. Signal 
multipath generally results in intersymbol interference in a digital communication system. 
Moreover, the signal components arriving via different propagation paths may add destruc- 
tively, resulting in a phenomenon called signal fading. Most people have experienced this 
phenomenon when listening to a distant radio station at night, when sky wave is the dom- 
inant propagation mode. Additive noise at HF is a combination of atmospheric noise and 
thermal noise. 

Sky-wave ionospheric propagation ceases to exist at frequencies above approxi- 
mately 30 MHz, which is the end of the HF band. However, it is possible to have iono- 
spheric scatter propagation at frequencies in the range of 30-60 MHz; this is a result of 
signal scattering from the lower ionosphere. It is also possible to communicate over dis- 
tances of several hundred miles using tropospheric scattering at frequencies in the range 
of 40-300 MHz. Troposcatter results from signal scattering due to particles in the atmo- 
sphere at altitudes of 10 miles or less. Generally, ionospheric scatter and tropospheric scat- 
ter involve large signal propagation losses and require a large amount of transmitter power 
and relatively large antennas. 

Frequencies above 30 MHz propagate through the ionosphere with relatively little 
loss and make satellite and extraterrestrial communications possible. Hence, at frequencies 
in the VHF band and higher, the dominant mode of electromagnetic propagation is LOS 
propagation. For terrestrial communication systems, this means that the transmitter and 
receiver antennas must be in direct LOS with relatively little or no obstruction. For this 
reason, television stations transmitting in the very high frequency (VHF) and ultra high 
frequency (UHF) bands mount their antennas on high towers in order to achieve a broad 
coverage area. 

In general, the coverage area for LOS propagation is limited by the curvature of 
the earth. If the transmitting antenna is mounted at a height h feet above the surface of 
the earth, the distance to the radio horizon is approximately d = /2h miles (assuming 
no physical obstructions such as a mountain). For example, a TV antenna mounted on a 
tower of 1000 feet in height provides a coverage of approximately 50 miles. As another 
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example, microwave radio relay systems used extensively for telephone and video trans- 
mission at frequencies above 1 GHz have antennas mounted on tall towers or on the top of 
tall buildings. 

At frequencies above 10 GHz, atmospheric conditions play a major role in signal 
propagation. In particular, heavy rain introduces extremely high propagation losses that can 
result in service outages (total breakdown in the communication system). For example, at 
10 GHz, heavy rain results in a propagation loss of approximately 0.3 dB/km; at 30 GHz, 
the loss in approximately 2 dB/km; at 100 GHz, the loss is approximately 5 dB/km. 

At frequencies above the millimeter wave band, we have the infrared and visible light 
regions of the electromagnetic spectrum, which can be used to provide LOS optical com- 
munication in free space. To date, these frequency bands have been used in experimental 
communication systems, such as satellite-to-satellite links. 


é Underwater Acoustic Channels. Over the past few decades, ocean exploration 
activity has been steadily increasing. Coupled with this increase in ocean exploration is the 
need to transmit data, which is collected by sensors placed underwater, to the surface of the 
ocean. From there, it is possible to relay the data via a satellite to a data collection center. 

Electromagnetic waves do not propagate over long distances underwater, except at 
extremely low frequencies. However, the transmission of signals at such low frequencies is 
prohibitively expensive because of the large and powerful transmitters required. The atten- 
uation of electromagnetic waves in water can be expressed in terms of the skin depth, which 
is the distance a gignal is attenuated by 1/e. For seawater, the skin depth 6 = 250/,/f, 
where f is expressed in Hertz and ô is in meters. For example, at 10 kHz, the skin depth is 
2.5 meters. In contrast, acoustic signals propagate over distances of tens and even hundreds 
of kilometers. j 

A shallow-water acoustic channel is characterized as a multipath channel due to sig- 
nal reflections from the surface and the bottom of the sea. Due to wave motion, the signal 
multipath components undergo time-varying propagation delays that result in signal fading. 
In addition, there is frequency-dependent attenuation, which is approximately proportional 
to the square of the signal frequency. 

Ambient ocean acoustic noise is caused by shrimp, fish, and various mammals. Addi- 
tionally, man-made acoustic noise exists near harbors. 

In spite of this hostile environment, it is possible to design and implement efficient 
and highly reliable underwater acoustic communication systems for transmitting digital 
signals over large distances. 


Storage Channels. Information storage and retrieval systems constitute a signif- 
icant part of our data-handling activities on a daily basis. Magnetic tape (including digital 
audio tape and video tape), magnetic disks (used for storing large amounts of computer 
data), and optical disks (used for computer data storage, music, and video) are examples of 
data storage systems that can be characterized as communication channels. The process of 
storing data on a magnetic tape, magnetic disk, or optical disk is equivalent to transmitting 
a signal over a telephone or a radio channel. The readback process and the signal process- 
ing used to recover the stored information is equivalent to the functions performed by a 
telephone receiver or radio communication system to recover the transmitted information. 
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Additive noise generated by the electronic components and interference from adja- 
cent tracks is generally present in the readback signal of a storage system. 

The amount of data that can be stored is generally limited by the size of the disk or 
tape and the density (number of bits stored per square inch) that can be achieved by the 
write/read electronic systems and heads. For example, a packing density of 10° bits/sq in 
has been achieved in magnetic disk storage systems. The speed at which data can be written 
on a disk or tape and the speed at which it can be read back is also limited by the associated 
mechanical and electrical subsystems that constitute an information storage system. 

Channel coding and modulation are essential components of a well-designed digital 
magnetic or optical storage system. In the readback process, the signal is demodulated and 
the added redundancy introduced by the channel encoder is used to correct errors in the 
readback signal. 


1.4 MATHEMATICAL MODELS FOR COMMUNICATION CHANNELS 


While designing communication systems to transmit information through physical chan- 
nels, we find it convenient to construct mathematical models that reflect the most important 
characteristics of the transmission medium. Then the mathematical model for the channel is 
used in the design of the channel encoder and modulator at the transmitter and the demod- 
ulator and channel decoder at the receiver. Next, we provide a brief description of three 
channel models that are frequently used to characterize many of the physical channels that 
we encounter in practice. 


The Additive Noise Channel. The simplest mathematical model for a commu- 
nication channel is the additive noise channel, illustrated in Figure 1.7. In this model, the 
transmitted signal s(t) is corrupted by the additive random-noise process n(t). Physically, 
the additive noise process may arise from electronic components and amplifiers at the 
receiver of the communication system, or from interference encountered in transmission, 
as in the case of radio signal transmission. 

If the noise is introduced primarily by electronic components and amplifiers at the 
receiver, it may be characterized as thermal noise. This type of noise is characterized sta- 
tistically as a Gaussian noise process. Hence, the resulting mathematical model for the 
channel is usually called the additive Gaussian noise channel. Because this channel model 
applies to a broad class of physical communication channels and because it has mathemat- 
ical tractability, this is the predominant channel model used in the analysis and design 
of communication systems. Channel attenuation is easily incorporated into the model. 
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Figure 1.7 The additive noise channel. 
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When the signal undergoes attenuation in transmission through the channel, the received 
signal is 
r(t) =as(t)+n(t), (1.4.1) 


where a represents the attenuation factor. 


The Linear Filter Channel. In some physical channels, such as wireline tele- 
phone channels, filters are used to ensure that the transmitted signals do not exceed spec- 
ified bandwidth limitations; thus, they do not interfere with one another. Such channels 
are generally characterized mathematically as linear filter channels with additive noise, as 
illustrated in Figure 1.8. Hence, if the channel input is the signal s(t), the channel output 
is the signal 


z r(t) = s(t)*h(t) +n(t) 


= 1 A(t)s(t —t) dt +n(t), (1.4.2) 
0 


where h(t) is the impulse response of the linear filter and * denotes convolution. 


The Linear Time-Variant Filter Channel. Physical channels, such as underwa- 
ter acoustic channels and ionospheric radio channels, which result in time-variant multipath 
propagation of the transmitted signal, may be characterized mathematically as time-variant 
linear filters. Such linear filters are characterized by the time-variant channel impulse 
response f(t; t), where h(t; t) is the response of the channel at time t, due to an impulse 
applied at time t — t. Thus, t represents the “age” (elapsed time) variable. The linear time- 
variant filter channel with additive noise is illustrated in Figure 1.9. For an input signal 
s(t), the channel output signal is 
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r(t) = s(t) «h(t; 1) +n(t) 


lee} 
= / h(t; t)s(t — t) dt +n(t). (1.4.3) 
—00 
Let us consider signal propagation through a multipath channel, such as the iono- 
sphere (at frequencies below 30 MHz) and mobile cellular radio channels. For such chan- 
nels, a good model for the time-variant impulse response has the form 


L 
h(t; t) = X ag(t)8(t — t), (1.4.4) 


k=l 


where the {az (t)} represents the possibly time-variant attenuation factor for the L multipath 
propagation paths. If Equation (1.4.4) is substituted into Equation (1.4.3), the received 
signal has the form 


L 
r(t) = Do a(ts — %) +n). (1.4.5) 


k=] 


Hence, the received signal consists of L multipath components, where each component is 
attenuated by {a;,} and delayed by {t4}. 

The three mathematical models previously described characterize a majority of phys- 
ical channels encountered in practice. These three channel models are used in this text for 
the analysis and design of communication systems. 


1.5 SUMMARY AND FURTHER READING 


Following a brief historical review of telecommunication developments over the past two 
centuries, we presented an introduction of the basic elements of analog and digital commu- 
nication systems and described several important advances in the development of digital 
communications in the first 60 years of the twentieth century. The second part of this 
chapter focused on the characteristics of different types of wireline and wireless commu- 
nication channels, including their mathematical models which are used in the design and 
performance analysis of communication systems. 

We have already cited several historical books on radio and telecommunications pub- 
lished in the past century. These include the books by McMahon (1984), Ryder and Fink 
(1984), and Millman (1984). In addition, the classical works of Nyquist (1924), Hartley 
(1928), Kotelnikov (1947), Shannon (1948), and Hamming (1950) are particularly impor- 
tant because they lay the groundwork of modern communication systems engineering. 


Signals and Linear 
Systems 





In this chapter, we will review the basics of signals and linear systems. The motivation 
for studying these fundamental concepts stems from the basic role they play in modeling 
various types of communication systems. In particular, signals are used to transmit infor- 
mation over a communication channel. Such signals are usually called information-bearing 
signals. Speech signals, video signals, and the output of an ASCII terminal are examples 
of information-bearing signals. 

When an information-bearing signal is transmitted over a communication channel, 
the shape of the signal is changed, or distorted, by the channel. In other words, the output 
of the communication channel, which is called the received signal, is not an exact replica 
of the channel input due to many factors, including channel distortion. The communication 
channel is an example of a system, i.e., an entity that produces an output signal when 
excited by an input signal. A large number of communication channels can be modeled 
closely by a subclass of systems called linear systems. Linear systems arise naturally in 
many practical applications and are rather easy to analyze. 


2.1 BASIC CONCEPTS 


In this book, we generally deal with communication signals that are functions of time, 
i.e., time is the independent variable. Examples of such signals are audio signals (speech, 
music), video signals, and data signals. Such signals are represented as mathematical func- 
tions of the form s(t), or x(t), or f(t). As an example, a sample waveform of a speech 
signal is shown in Figure 2.1. 


2.1.1 Basic Operations on Signals 


Basic operations on signals involve time shifting, time reversal (flipping), and time scaling. 
In this section, we describe the effect of these operations on signals. 


Time Shifting. Shifting, or delaying, a signal x(t) by a given constant time fo 
results in the signal x(t — to). If tọ is positive, this action is equivalent to a delay of to; thus, 
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s(t), Speech amplitude 





0 128 256 384 512 640 Figure 2.1 A sample speech 


Time, t (msec) waveform. 





Figure 2.2 Time shifting of a signal. 


the result is a shifted version of x(t) by to to the right. If fp is negative, then the result is a 
shift to the left by an amount equal to |fo|. A plot of a signal shift for positive tọ is shown 
in Figure 2.2. 


Time Reversal. Time reversal, or flipping, of a signal results in flipping the signal 
around the vertical axis, or creating the mirror image of the plot with respect to the vertical 
axis. We can visualize this flipping of a signal as playing an audio tape in reverse. As a 
result, positive times are mapped as negative times and vice versa. In mathematical terms, 
time reversal of x(t) results in x(—t). Figure 2.3 shows this operation. 


Time Scaling. Time scaling of a signal results in a change in the time unit against 
which the signal is plotted. Time scaling results in either an expanded version of the signal 
(if the new time unit is a fraction of the original time unit) or a contracted version of the 
original signal (if the new time unit is a multiple of the original time unit). In general, time 
scaling is expressed as x(at) for some a > 0. If a < 1, then the result is an expanded 
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version of the original signal (such as a tape which is played at a slower speed than it was 
recorded). If a > 1, the result is a contracted form of the original signal (such as a tape that 
is played at a higher speed than it was recorded). The case of a > 1 is shown in Figure 2.4. 

In general, we may have a combination of these operations. For instance, x(—2r) is 
a combination of flipping the signal and then contracting it by a factor of 2. Also, x(2t — 3) 
is equal to x[2(t — 1.5)], which is equivalent to contracting the signal by a factor of 2 and 
then shifting it to the right by 1.5. 


2.1.2 Classification of Signals 


The classification of signals makes their study easier. Depending on the point of view, 
signals can be classified in a variety of ways. In this section, we present the most important 
ways to classify signals. 


Continuous-Time and Discrete-Time Signals. Based on the range of the inde- 
pendent variable, signals can be divided into two classes: continuous-time signals and 
discrete-time signals. A continuous-time signal is a signal x(t) for which the independent 
variable t takes real numbers. A discrete-time signal, denoted by x[n], is a signal for which 
the independent variable n takes its values in the set of integers. 

By sampling a continuous-time signal x(t) at time instants separated by To, we can 
define the discrete-time signal x[n] = x(nTo). Figure 2.5 shows examples of discrete-time 
and continuous-time signals. 


Example 2.1.1 
Let 


x(t) = Acos(2nfot + 0). 


This is an example of a continuous-time signal called a sinusoidal signal. A sketch of this 
signal is given in Figure 2.6. a 
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x(t) 


Figure 2.5 Examples of discrete-time and 
continuous-time signals. 


x(t) 


-3 Figure 2.6 Sinusoidal signal. 


Example 2.1.2 
Let 
x[n] = A cos(2x fon + 9), 


where n € Z (Z is the set of integers). A sketch of this discrete-time signal is given in 
Figure 2.7. n 


Real and Complex Signals. Signals are functions, and functions at a given value 
of their independent variable are just numbers, which can be either real or complex. A real 
signal takes its values in the set of real numbers, i.e., x(t) € R. A complex signal takes its 
values in the set of complex numbers, i.e., x(t) € C. 

In communications, complex signals are usually used to model signals that convey 
amplitude and phase information. Like complex numbers, a complex signal can be rep- 
resented by two real signals. These two real signals can be either the real and imaginary 
parts or the absolute value (or modulus or magnitude) and phase. A graph of a complex 
signal can be given by graphs in either of these representations. However, the magnitude 
and phase graphs are more widely used. 
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Example 2.1.3 
The signal 
x(t) = Aei Tto) 
is a complex signal. Its real part is 


x(t) = Acos(2n fot + 0) 


and its imaginary part is 
x(t) =A sin(2m fot +6), 


where we have used Euler’s relation e/* = cos¢ + j sing. We could equivalently describe 
this signal in terms of its modulus and phase. The absolute value of x(t) is 


Ix()| = yx?) + x? (t) = JAI, 


Zx(t) = 2nfott+. 


and its phase is 


Graphs of these functions are given in Figure 2.8. a 


The real and complex components, as well as the modulus and phase of any complex 
signal, are represented by the following relations: 


x(t) = |x(t)| cos (Zx(t)) , (2.1.1) 

xi(t) = |x(t)| sin (Zx(0)) , (2.1.2) 

OI = x2) + x? 0), l (2.1.3) 
x(t) 

Zx(t) = arctan ; (2.1.4) 





X(t) 
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Deterministic and Random Signals. In a deterministic signal at any time instant 
t, the value of x(t) is given as a real or a complex number. In a random (or stochastic) sig- 
nal at any given time instant t, x(t) is a random variable; i.e., it is defined by a probability 
density function. 
All of our previous examples were deterministic signals. Random signals are dis- 
cussed in Chapter 5. 


Periodic and Nonperiodic Signals. A periodic signal repeats in time; hence, it 
is sufficient to specify the signal in a basic interval called the period. More formally, a 
periodic signal is a signal x(t) that satisfies the property 


x(t + To) = x(t) 


for all t£, and some positive real number Tọ (called the period of the signal). For discrete- 
time periodic signals, we have 
x[n + No] = x[n] 


for all integers n, and a positive integer No (called the period). A signal that does not satisfy 
the conditions of periodicity is called nonperiodic. 


Example 2.1.4 
The signals 
x(t) = Acos(2n fot + 0) 


and 
x(t) = Aei far+6) 


are examples of real and complex periodic signals. The period of both signals is JT) = a The 
signal 


1 t>0 
u(t) = 0O r<0 (2.1.5) 
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Figure 2.9 The unit-step signal. 


illustrated in Figure 2.9 is an example of a nonperiodic signal. This signal is known as the 
unit-step signal. a 


Example 2.1.5 
The discrete-time sinusoidal signal shown in Figure 2.7 is not periodic for all values of fo. For 
this signal to be periodic, we must have 


2x fo(n + No) +0 = 2x fon +O0+2mn (2.1.6) 
for all integers n, some positive integer No, and some integer m. Thus, we conclude that 
2nfoNo = 2nm 


m 
t 0 = =, 
f No 
i.e., the discrete sinusoidal signal is periodic only for rational values of fo. For instance, 
A cos(3mn + 6) is periodic, but A cos(./2nn + 0) is not periodic. E 


Causal and Noncausal Signals. Causality is an important concept in classifying 
systems. This concept has a close relationship to the realizability of a system. We will 
cover this issue later, during our discussion on various types of systems. Now we define 
the concept of causal signals, which is closely related to the concept of causal systems. A 
signal x(t) is called causal if for all t < 0, we have x(t) = 0; otherwise, the signal is 
noncausal. Equivalently, a discrete-time signal is a causal signal if it is identically equal to 
zero for n < 0. 


Example 2.1.6 
The signal 


x(t) = A cos(27 fot + 0) fort > 0 
~ 10 otherwise 
is a causal signal. Its graph is shown in Figure 2.10. a 


Similarly, we can define anticausal signals as signals whose time inverse is causal. 
Therefore, an anticausal signal is identically equal to zero for t > 0. 


Even and Odd Signals. Evenness and oddness are expressions of various types 
of symmetry present in signals. A signal x(t) is even if it has mirror symmetry with respect 
to the vertical axis. A signal is odd if it is antisymmetric with respect to the vertical axis. 
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Figure 2.10 An example of a causal 
signal. 


The signal x(t) is even if and only if, for all t, 
x(-t) = x(t), 
and is odd if and only if, for all ¢, 
x(—t) = —x(t). 


Figure 2.11 shows graphs of even and odd signals. 
In general, any signal x(t) can be written as the sum of its even and odd parts as 


x(t) = Xe(t) + Xo(t), (2.1.7) 


x(t) 


x(t) 


Figure 2.11 Examples of even and 
odd signals. 
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where 
x)= eee (2.1.8) 
Xo(t) = ens (2.1.9) 


Example 2.1.7 
The sinusoidal signal x(t) = Acos(2n fot + 0) is generally neither even nor odd. However, 
the special cases 6 = 0 and 0 = +5 correspond to even and odd signals, respectively. In 
general, 


A A 
x(t) = 7 cos(@) cos(2x fot) — 7 sin(@) sin(2x fot). 
Since cos(2v fot) is even and sin(21 fot) is odd, we conclude that 
A 
Xe(t) = ci cos(@) cos(2m fot) 
and 
A, i 
x,(t) = a7 sin(@) sin(2 x fot). E 


Example 2.1.8 
From Figure 2.8, we can see that for 0 = 0 and x(t) = Ae/*"*', the real part and the magni- 
tude are even and the imaginary part and the phase are odd. a 


Hermitian Symmetry for Complex Signals. For complex signals, another form 
of symmetry, called Hermitian symmetry, is also defined. A complex signal x(t) is called 
Hermitian if its real part is even and its imaginary part is odd. In addition, we can easily 
show that its magnitude is even and its phase is odd. The signal x(t) = Ae/?*/0' is an 
example of a Hermitian signal. 


Energy-Type and Power-Type Signals. This classification deals with the energy 
content and the power content of signals. Before classifying these signals, we need to define 
the energy content (or simply the energy) and the power content (or power). 

For any signal x(t), the energy content of the signal is defined by! 


+00 T/2 
e= |x(t)| dt = lim / [x(t)|? dt. (2.1.10) 
-co T0090 -T/2 
The power content is defined by 

T/2 


1 
P, = lim — gt. 2.1.11 
eet ate oe |x(t)| dt ( ) 


For real signals, |x(t)|? is replaced by x?(t). 


'If x(t) indicates the voltage across a | Q resistor, then the current flowing through the resistor is x(t) and 
the instantaneous power is x?(t). We can justify Equation (2.1.10) by noting that energy is the integral of power. 
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A signal x (t) is an energy-type signal if and only if ©, is finite. A signal is a power- 
type signal if and only if P, satisfies 


0 < P, < œ. 
Example 2.1.9 
Find the energy in the signal described by 
raad PISE, 
O otherwise 


Solution We have : 
+00 3 

8.= | moran = | 9dt = 54. 
bas 5 


Therefore, this signal is an energy-type signal. a 


Example 2.1.10 
The energy content of A cos(21 fot + 0) is 
T/2 


Gx = lim A? cos? (2x fot + 0) dt = 00. 
T-00 -T/2 


Therefore, this signal is not an energy-type signal. However, the power of this signal is 


1 T/2 
P, = lim = / A? cos?(2n fot + 0) dt 


T/2 
1 [T2 a2 
= jim, 7 “S F [1 + cos(4x fot + 20)] dt 
AT A? ue 
= lim | —— + | —— sin(4n fot + 20 
Jim) Sr * [aan nen | 
AZ 
=— <œ (2.1.12) 
2 
Hence, x(t) is a power-type signal and its power is £. a 


Example 2.1.11 
For any periodic signal with period Tọ, the energy is 


T/2 
Z, = lim |x(t)|?dt 
T->00 -T/2 


nk 


z 
= lim f |x(t)?dt 


noo J_ aT 


+3 
= tim a f Ix(t)Pdt 


T 
2 


= oO. (2.1.13) 
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Therefore, periodic signals are not typically energy type. The power content of any periodic 
signal is 


P, li 17” 2 
x im z dt 
PAT ee Ix(2)| 


4 


nt il 2 2 
= lim — |x(t)|*dt 
n= nTo -2 
= lim — | |x) Pat 
noo NTo -9 
= zj: |x(t)|*dt. (2.1.14) 
To -2 


This means that the power content of a periodic signal is equal to the average power in one 
period. E 


2.1.3 Some Important Signals and Their Properties 


In our study of communication systems, certain signals appear frequently. In this section, 
we briefly introduce these signals and describe some of their properties. 


1 y 
The Sinusoidal Signal. The sinusoidal signal is defined by 
x(t) = Acos(2n fot + 0), 


where the parameters A, fo, and 0 are, respectively, the amplitude, frequency, and phase 
of the signal. A sinusoidal signal is periodic with the period Tọ = 1/fo. For a graph of this 
signal, see Figure 2.6. 


The Complex Exponential Signal. The complex exponential signal is defined 
by x(t) = Ae/@*Jo'+®), Again A, fo, and @ are, respectively, the amplitude, frequency, and 
phase of the signal. This signal is shown in Figure 2.8. 


The Unit-Step Signal. The unit-step signal, which is defined in Section 2.1.2, is 
another frequently encountered signal. The unit step multiplied by any signal produces a 
“causal version” of the signal. The unit-step signal is shown in Figure 2.9. Note that for 
positive a, we have u (at) = u1 (t). 


Example 2.1.12 
To plotthe signal u_, (t) +2u_,(t —1) —u_,(t—2), we note that this is a result of time shifting 
the unit-step function. The plot is shown in Figure 2.12. a 


The Rectangular Pulse. This signal is defined as 


] _1 <t< 1 
I(t) = Beat eres (2.1.15) 
0 otherwise 


The graph of the rectangular pulse is shown in Figure 2.13. 
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x(t) 4 






” Figure 2.12 The signal 
u_y (t) + Qu_\(t — 1) — u(t — 2). 


aoe e 





NIR j--~------ 
~ 


f Figure 2.13 The rectangular pulse. 
Example 2.1.13 
To plot the signal 21 (42) — T (42), we note that this signal is the difference between two 
time-shifted and time-scaled versions of I1(t). Its plot is shown in Figure 2.14. a 


The Triangular Signal. This signal is defined as 


t+1 -1<1t<0 
A(@t)=¢-—t+1 O<t<1 . (2.1.16) 
0 otherwise 


x(£) j} 





F Figure 2.14 The signal 
20 ($) - (4). 


l 

I ! I 

I { I 

l I l 

1 3 5 6 


Section 2.1 Basic Concepts : 33 


A) 
1 





Figure 2.15 The triangular signal. 


Its plot is shown in Figure 2.15. It is not difficult to verify that? 


A(t) = TI (t) x TI (t). (2.1.17) 

Example 2.1.14 
To plot M ( £) +A ( £), we use the time scaling of signals. The result is shown in Figure 2.16. 
E 


The Sinc Signal. The sinc signal is defined as 


sin(xr) t # 0 


Tt 


(2.1.18) 
1 t=0 


sinc(t) = 


The waveform corresponding to this signal is shown in Figure 2.17. From this figure, we 
can see that the sinc signal achieves its maximum of 1 at t = 0. The zeros of the sinc signal 
are at t = +1, +2, +3,.... 


x(t) |. 


Figure 2.16 The signal 
T (4) + A (5). 





Note that x(t) * y(t) denotes the convolution of two signals, which is defined by 


+00 +00 


x(t) x y(t) = [ x(t)y(t —t)dt = f x(t — t)y(t)dt = y(t) x x(t). 


= -co 


For more details, see Section 2.1.5. 
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sinc (t) į 
1 





Figure 2.17 The sinc signal. 


_ The Sign or the Signum Signal. The sign or the signum signal denotes the sign 
of the independent variable ¢ and is defined by 


1 t>0 
sen(t) = 4-1 t<0. (2.1.19) 
0 t=0 


This signal is shown in Figure 2.18. The signum signal can be expressed as the limit of the 
signal x,(¢), which is defined by 


t 
Ta t>0 


e 
Xn (t) = 4 —en t<0, (2.1.20) 
0 


when n —> oo. We will later use this definition of the signum signal to find its Fourier 
transform. This limiting behavior is also shown in Figure 2.18. 


Figure 2.18 The signum signal as the limit of 
Xn(t). 
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The Impulse or Delta Signal. The impulse or delta signal is a mathematical 
model for representing physical phenomena that occur in a very small time duration, are 
so small that they are beyond the resolution of the measuring instruments involved, and for 
all practical purposes, have a duration that can be assumed to be equal to zero. Examples 
of such phenomena are a hammer blow, a very narrow voltage or current pulse, and so on. 
In the precise mathematical sense, the impulse signal ô(t) is not a function (or signal)—it 
is a distribution or a generalized function. A distribution is defined in terms of its effect on 
another function (usually called the “test function”) under the integral sign. The impulse 
distribution (or signal) can be defined by the relation 


+00 
f $(¢)d(t) dt = $ (0), (2.1.21) 


which expresses the effect of the impulse distribution on the “test function” ¢ġ (t), assumed 
to be continuous at the origin. This property is called the sifting property of the impulse 
signal. In other words, the effect of the impulse signal on the “test function” ¢ (t) under the 
integral sign is to extract or sift its value at the origin. As shown, 4(¢) is defined in terms of 
its action on ¢ (t) and not defined in terms of its value for different values of t. 

Sometimes it is helpful to visualize 5(¢) as the limit of certain known signals. The 
most commonly used forms are 


1 t 
6(t) = lim -M (z) (2.1.22) 
€j0 € € 
and 
TE ES t 
ô(t) = lim —sinc (:) ‘ (2.1.23) 
€10 € € 


Figure 2.19 shows graphs of these signals. (The symbol e | 0 means that € tends to zero 
from above, i.e., it remains positive.) 
The following properties are derived from the definition of the impulse signal: 


1. 8(t) = 0 for all £ # 0 and 8(0) = œ. 
2. x(t)d(t — to) = x(to) b(t — to). 
3. For any ¢(t) continuous at fo, 


j co. (t — to) dt = $ (to). (2.1.24) 
4. For any ¢ (t) continuous at to, 
J “04 + to)ô (t) dt = ¢ (to). (2.1.25) 
5. For all a £0, 
s(at) = L304). (2.1.26) 


lal 
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Figure 2.19 The impulse signal as a limit. 


6. Theresult of the convolution of any signal with the impulse signal is the signal itself: 
x(t) x O(t) = x(t). (2.1.27) 

Also, 
x(t) * 6(t — to) = x(t — to). (2.1.28) 
7. The unit-step signal is the integral of the Sites signal, and the impulse signal is 


the generalized derivative of the unit-step signal, i.e., 


t 
u(t) = f 8(t) dt (2.1.29) 


—00 


and 


d 
8) = Fu. (2.1.30) 
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8. Similar to the way we defined 5(t), we can define 6’(t), 8” (t), ..., 8™® (t), the gen- 


eralized derivatives of 5(t), by the following equation: 


+00 d" 
i: 5 (t)(t) dt = CDir P| i (2.1.31) 
—00 dt" t=0 
We can generalize this result to 
+00 d" 
f aO- w)G(t)dt = (-1)" ob) - (2.1.32) 
—0o dat” t=t0 3 


. The result of the convolution of any signal with nth derivative of x(t) is the nth 


derivative of x (t), i.e., 
x(t) x 8O) = x(t) (2.1.33) 


and in particular 
x(t) xô (t) = x' (t). (2.1.34) 


10. The result of the convolution of any signal x (t) with the unit-step signal is the integral 


11. 


of the signal x(t), i.e., 
t 


rena = | x(t) dt. (2.1.35) 


For even values of n, 5“) (t) is even; for odd values of n, it is odd. In particular, 5(t) 
is even and 6’(t) is odd. 


A schematic representation of the impulse signal is given in Figure 2.20, where the 


integral of 5(t) over all time (—oo < t < 00) is unity. 


Example 2.1.15 


Determine (cost)(t), (cost)d(2t — 3), and JS e™s'(t — 1) dt. 
Solution To determine (cost)ô(t), we can use Property 2: 
(cost)d(t) = (cos 0)ô(t) = ô (t). 


To determine (cos t)5(2t — 3), we can use Property 5: 


sar ~3) = 58(#-3). 





0 t Figure2.20 The impulse signal. 
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Then, from Property 1, we have 


1 3 cos 1.5 3 3 
= = == l ô —-- |U. —-]. 
(cos t)5(2t — 3) z (cos t)ô (: >) 7 (: >) 0.0358 (: >) 





Finally, to determine JES e™ô' (t — 1) dt, we use Property 8 to obtain 


foe} ; d 
e'8(t—1)dt =(-1)—e"", =e! 
T. (t— 1) (Dz | e. m 


t=1 


2.1.4 Classification of Systems 


A system is an interconnection of various elements or devices that, from a certain view- 
point, behave as a whole. From a communication point of view, a system is an entity that 
is excited by an input signal and, as a result of this excitation, produces an output signal. 
From a communication engineer’s point of view, a system is a law that assigns output sig- 
nals to various input signals. For example, an electric circuit with some voltage source as 
the input and some current in a certain branch is a system. The most important point in the 
definition of a system is that its output must be uniquely defined for any legitimate input. 
This definition can be written mathematically as 


y(t) = FA], (2.1.36) 


where x(t) is the input, y(¢) is the output, and J is the operation performed by the system. 
Figure 2.21 shows a pictorial representation of a system. 


Example 2.1.16 
The input-output relationship y(t) = 3x(t) + 3x?(t) defines a system. For any input x(t), the 
output y(t) is uniquely determined. | 


A system is defined by two characteristics: (1) the operation that describes the system and 
(2) the set of legitimate input signals. In this text, we will use the operator J to denote the 
operation that describes the system. We will use Æ to denote the space of legitimate inputs 
to the system. 


Example 2.1.17 
The system described by the input-output relationship 


d 
yt) = F[x(t)] = gO (2.1.37) 


for which Æ is the space of all differentiable signals, describes a system. This system is 
referred to as the differentiator. a 


The space %Æ is usually defined by the system operation; therefore, it is not usually given 
explicitly. 


Figure 2.21 A system with an input and output. 
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As in the case of signals, systems can be classified according to their properties. 
Based on this point of view, various classifications are possible. We will now briefly intro- 
duce some of the fundamental system classifications. 


Discrete-Time and Continuous-Time Systems. Systems are defined by the 
operation that the input signals use to produce the corresponding output signal. Systems 
can accept either discrete-time or continuous-time signals as their inputs and outputs. This 
is the basis of system classification into continuous-time and discrete-time systems. 

A discrete-time system accepts discrete-time signals as the input and produces discrete- 
time signals as the output. For a continuous-time system, both input and output signals are 
continuous-time signals. 


Example 2.1.18 


The systems described in the two previous examples are both continuous-time systems. An 
example of a discrete-time system is the system described by 


yin] = x[n] — x[n — 1]. (2.1.38) 


This system is a discrete-time differentiator. E 


Linear and Nonlinear Systems. Linear systems are systems for which the super- 
position property is satisfied, i.e., the system’s response to a linear combination of the 
inputs is the linear combination of the responses to the corresponding inputs. 

A system J is linear if and only if, for any two input signals x; (t) and x2(t) and for 
any two scalars œ and £, we have 


F [ax (t) + Bx2(t)] = aF [1] + BF [2]. (2.1.39) 


A system that does not satisfy this relationship is called nonlinear. 
Linearity can also be defined in terms of the following two properties: 
T t =J T t 
[x1 (t) + x2(t)] = F l+ F xlt] l (2.1.40) 
T [æx(t)] - =a F [x(t)] 


A system that satisfies the first property is called additive, and a system that satisfies 
the second property is called homogeneous. From the second property, it is obvious that 
J [0] = 0 in a linear system. In other words, the response of a linear system to a zero input 
is always zero (for linearity, this is a necessary but not a sufficient condition). 

Linearity is a very important property. In a linear system, we can decompose the 
input into a linear combination of some fundamental signals whose output can be derived 
easily, and then we can find the linear combination of the corresponding outputs. We denote 
the operation of linear systems by £, rather than by J. 


Example 2.1.19 
The differentiator described earlier is an example of a linear system. This is true because if 


x(t) and x2(t) are differentiable, œxı (t) + 6x2(t) must also be differentiable for any choice 
of a and £, and 


£ [axi(t) + Br2(0)) = axi (D) + pxl). 
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The system described by 
y(t) = ax? (t) 


is nonlinear because its response to 2x(t) is 
T [2x(t)] = 4x° (t) # 2x? (t) = 2F [x(0)]; 


therefore, the system is not homogeneous. 


Example 2.1.20 


A delay system is defined by y(t) = x(t — A), i.e., the output is a delayed version of the input. 
(See Figure 2.22.) If x(t) = ax, (t) + Bx2(t), then the response of the system is obviously 
ax,(t — A) + x(t — A). Therefore, the system is Jinear. r 


Time-Invariant and Time-Varying Systems. A system is called time invari- 
ant if its input—output relationship does not change with time. This means that a delayed 
version of an input results in a delayed version of the output. 

A system is time invariant if and only if, for all x(t) and all values of fo, its response 
to x(t — to) is y(t — tg), where y(t) is the response of the system to x(t). (See Figure 2.23.) 


Example 2.1.21 
The differentiator is a time-invariant system, since 


d 
ae — to) = x(O liir z i 


Example 2.1.22 


The modulator, defined by y(t) = x(t) cos 2x fot, is an example of a time-varying system. 
The response ofthis system to x (t — fo) is 


x(t — to) cos(2 fot), 


which is not equal to y(t — to). E 


x(t) — a |- y( =x(t— A) Figure2.22 The input-output relation for the delay 
system. 


- HD y pes E E es 


Figure2.23 A time-invariant system. 


to 
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The class of linear time-invariant (LTI) systems is particularly important. The response 
of these systems to inputs can be derived simply by finding the convolution of the input 
and the impulse response of the system. We will discuss this property in more detail in 
Section 2.1.5. 


Causal and Noncausal Systems. Causality deals with the physical realizability 
of systems. Since no physical system can predict the values that its input signal will assume 
in the future, we can assume that in a physically realizable system, the output at any time 
depends only on the values of the input signal up to that time and does not depend on the 
future values of the input. 

A system is causal if its output at any time tọ depends on the input at times prior to 
to, 1.€., 


y(to) = F[x(t) : t < to]. 


A necessary and sufficient condition for an LTI system to be causal is that its impulse 
response A(t) (i.e., the output when the input is ô(t), see Section 2.1.5) must be a causal 
signal, i.e., for £ < 0, we must have h(t) = 0. For noncausal systems, the value of the 
output at fo also depends on the values of the input at times after tọ. Noncausal systems are 
encountered in situations where signals are not processed in real time? 


Example 2.1.23 
A differentiator is an example of a causal system since it is LTI and its impulse response, 
h(t) = 6’(t), is zero for t < 0. A modulator is a causal but time-varying system since its 
output, y(t) = x(t) cos(21 fot), at time t depends on the value of the input at time ¢ and not 
on future values of x(t). The delay system defined in Example 2.1.20 is causal for A > 0 and 
noncausal for A < 0. (Why?), since its impulse response ô(t — A) is zero fort <Oif A >0 
and nonzero if A < 0. C 


2.1.5 Analysis of LTI Systems in the Time Domain 


The class of LTI systems plays an important role both in communication and system theory. 
For this class of systems, the input—output relationship is particularly simple and can be 
expressed in terms of the convolution integral. To develop this relationship, we first intro- 
duce the concept of the impulse response of a system. 

The impulse response h(t) of a system is the response of the system to a unit impulse 
input ô (t): 


h(t) = F [60]. 


The response of the system to a unit impulse applied at time T, i.e., ô(t — T), is denoted by 
h(t, T). Obviously, for time-invariant systems, h(t, t) = h(t — t).4 


3For instance, when the entire signal is recorded and then processed. In such a case when processing the 
signal at time fọ we have access to its future values. 

“Note that this notation is sloppy. It uses A to denote two different functions: A(t, t), which is a function 
of two variables, and A(t), which is a function of one variable. 


42 Signals and Linear Systems Chapter 2 


The Convolution Integral. Now, we will derive the output y(t) of an LTI system 
to any input signal x(t). We will show that y(t) can be expressed in terms of the input x (t) 
and the impulse response A(t) of the system. 
In Section 2.1.3 we showed that, for any signal x(t), we have 


+00 


x(t) = x(t) x ô(t) =/ x(t)d(t — tT) dt. (2.1.41) 


—00 


Now, if we denote the response of the LTI system to the input x(t) by y(t), we can write 


y(t) = £[x(t)] 


+00 i 
=s[f soe- 2) dr] 


+00 


= x(t) L [S(t —t)] dt 


joe) 


b +00 
af x(t)h(t —t)dt 


oe) 


= x(t) * h(t), (2.1.42) 


where (a) follows from the linearity of the system (note that an integral is basically the 
limit of a sum) and (b) follows from the time invariance. This shows that the response to 
x(t) is the convolution of x(t) and the impulse response h(t). Therefore, for the class of 
LTI systems, the impulse response completely characterizes the system. This means that 
the impulse response contains all the information we need to describe the system behavior. 


Example 2.1.24 
The system described by 


yt) = i x(t) dt (2.1.43) 


is called an integrator. Since integration is linear, this system also is linear. Also, the response 
to x(t — to) is 


no = f x(t — to) dt 


t—to 
= / x(u) du 
= y(t — to), (2.1.44) 


where we have used the change of variables u = t — tọ. We can see that the system is LTI. - 
The impulse response is obtained by applying an impulse at the input, i.e., 


h(t) = iA 6(t) dt = u(t). a 


foe) 
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Example 2.1.25 


Let an LTI system have the impulse response h(t). Assume that this system has a com- 
plex exponential signal as input, i.e., x(t) = Ae/@*/0'+*). The response to this input can 


be obtained by 
+00 , 
y(t) = / h(t) Aei- + ge 
-00 
+ 
= aaen ” pajeg 
—00 
= AJH (fo)e CTL), (2.1.45) 
where 
. too PY 
H fo) =E = [nye Pra, (2.1.46) 
—00 


This shows that the response of an LTI system to the complex exponential with frequency 
fo is a complex exponential with the same frequency. The amplitude of the response can be 
obtained by multiplying the amplitude of the input by |H (fo)|, and its phase is obtained by 
adding 2H (fo) to the input phase. Note that H (fo) is a function of the impulse response and 
the input frequency. Because of this property, complex exponentials are called eigenfunctions 
of the class of LTI systems. The eigenfunctions of a system are the set of inputs for which the 
output is a scaling of the input. Because of this important property, finding the response of 
LTI systems to the class of complex exponential signals is particularly simple; therefore, it is 
desirable to find ways to express arbitrary signals in terms of complex exponentials. We will 
later explore ways to do this. = 


2.2 FOURIER SERIES 
A large number of building blocks in a communication system can be modeled by LTI 
systems. LTI systems provide good and accurate models for a large class of communica- 
tion channels. Also, some basic components of transmitters and receivers, such as filters, 
amplifiers, and equalizers, are LTI systems. 
Our main objective is to develop methods and tools necessary to analyze LTI systems. 
When analyzing a system, we want to determine the output corresponding to a given input 
and, at the same time, provide insight into the behavior of the system. We have already 
seen that the input and output of an LTI system are related by the convolution integral, 
given by > 


+00 +00 
y(t) = 'h(t)x(t — tr) dt =| h(t — t)x(t) dt, (2.2.1) 
—00 —00 

where h(t) denotes the impulse response of the system. The convolution integral provides 
the basic tool for analyzing LTI systems. However, there are major drawbacks in the direct 
application of the convolution integral. First, using the convolution integral to find the 
response of an LTI system may be straightforward, but it is not always an easy task. Second, 
even when the convolution integral can be performed with reasonable effort, it may not 
provide good insight into how the system behaves in response to other input signals. 
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In the next two sections, we will develop another approach to analyzing LTI systems. 
The basic idea is to expand the input as a linear combination of some basic signals whose 
output can be easily obtained, and then to employ the linearity properties of the system to 
obtain the corresponding output. This approach is much easier than a direct computation 
of the convolution integral; at the same time, it provides better insight into the behavior of 
LTI systems. This method is based on the close similarity between the expansion of signals 
in terms of a basic signal set and the expansion of vectors in Euclidean space in terms of 
unit vectors. 


2.2.1 Fourier Series and Its Properties 


The set of complex exponentials are the eigenfunctions of LTI systems. The response of 
an LTI system to a complex exponential is a complex exponential with the same frequency 
with a change in amplitude and phase. Example 2.1.25 showed that the change in phase 
and amplitude are functions of the frequency of the complex exponential and the impulse 
response of the LTI system. So, which signals can be expanded in terms of complex expo- 
nentials? To answer this question, we will give the conditions for a periodic signal to be 
expandable in terms of complex exponentials. The expansion of nonperiodic signals will 
be discussed later. 

Let the signal x(t) be a periodic signal with period Tọ. First, we need to determine 
whether the following Dirichlet conditions are satisfied: 


1. x(t) is absolutely integrable over its period, i.e., 


To 
Í |x(t)|dt < 00, 
0 


2. The number of maxima and minima of x(t) in each period is finite, 
3. The number of discontinuities of x(t) in each period is finite. 


If these conditions are met, then x (t) can be expanded in terms of the complex exponential 
: j2n Ft) +00 
signals {e 70 } as 


n=—00 
+00 ed 
x)= Y h", (2.2.2) 
n=—00 
where 
1 a+To -jnt 
Xn = a x(t)e To dt (2.2.3) 
0 Ja 


for some arbitrary œ. 
Some observations concerning this theorem are as follows: 


° The coefficients x, are called the Fourier-series coefficients of the signal x(t). These 
are generally complex numbers (even when x(t) is a real signal). 

° The parameter œ in the limits of the integral is arbitrary. It can be chosen to simplify 
the computation of the integral. Usually, œ = 0 or a = —To/2 are good choices. 
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The Dirichlet conditions are sufficient conditions for the existence of the Fourier- 
series expansion. For some signals that do not satisfy these conditions, we can still 
find the Fourier-series expansion. 

The quantity fo = $ is called the fundamental frequency of the signal x(t). We 
observe that the frequencies of the complex exponential signals are multiples of this 
fundamental frequency. The nth multiple of fo is called the nth harmonic. 
Conceptually, this is a very important result. It states that the periodic signal x(t) can 
be described by the period Tọ (or the fundamental frequency fo) and the sequence 
of complex numbers {xn}. Thus, to describe x(t), we may specify a countable set 
of complex numbers. This considerably reduces the complexity of describing x(t), 
since to define x(t) for all values of t, we have to specify its values on an uncount- 
able set of points. 

The Fourier-series expansion can be expressed in terms of the angular frequency 
wy = 27 fo by 


a4 28 
in = = a (peet dt (2.2.4) 
a 
and 

+00 
x)= >> xe, (2.2.5) 

n=—0o 
In general, xn = [xnle/ Zen, Thus, |x,| gives the magnitude of the nth harmonic 


and 2x, gives its phase. Figure 2.24 shows a graph of the magnitude and phase of 
various harmonics in x(t). This type of graph is called the discrete spectrum of the 
periodic signal x (t). 


Figure 2.24 The discrete spectrum of x(t). 
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Example 2.2.1 
Let x(t) denote the periodic signal depicted in Figure 2.25 and described analytically by 


aus t-n 
w=) n( : ): (2.2.6) 


n=—00 





where T is a given positive constant (pulse width). Determine the Fourier-series expansion for 
this signal. g 


Solution We first observe that the period of the signal is To and 


] +2 — jng Se 
= zI x(t)e "n dt” 
TE a 


=> 1e" n dt 
To J-5 
1 To [ inte Hee 
= — 0 — 0 
To —jn27 : ia 
1 NIT 
= — SI —— 0 
nn 5 ( To ) Le 
T T f 
= nos (F) n #0, (2.2.7) 
where we have used the relation sing = eee For n = 0, the integration is very simple 
and yields x9 = Te Therefore, 
+00 
T nt in ont 
x(t) = =si — je", 2.2.8 
(t) D Fine (Fe ( ) 
n=—0O 
A graph of these Fourier-series coefficients is shown in Figure 2.26. a 
Example 2.2.2 
Determine the Fourier-series expansion for the signal x(t) shown in Figure 2.27 and des- 
cribed by 
+00 
x(t)= J ODE- n). (2.2.9) 
n=—00 





T To “Figure 2.25 Periodic signal x(t) in 
2 2 Equation (2.2.6). 
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Figure 2.26 The discrete spectrum of 
the rectangular-pulse train. 









—5/2 


-1 Figure 2.27 Signal x(t) in Equation (2.2.9). 


Solution Since Tọ = 2, it is convenient to choose a = — 5. First, we note that for n = 0, we 
can easily find the integral to be zero; therefore x» = 0. For n 4 0, we have 


3 


1 f2 F 
y= >/ x(t)eJ"™ dt 
25-4 s 


1r, iro 
=> edt — — e"™ dt 
2 f, 2 Í 


2 








1 nt 1 ; nT 
= —~sin{ —) — a gn ( — 
= —sin(>) aoe sin(=) 

1 . (nt 
= tl — cos(nm)) sin ( >) 

4 n=4k+1 
=}-2 n=4k4+3. (2.2.10) 


0 neven 
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From these values of xn, we have the following Fourier-series expansion for x(t): 


E ; 2 pj é Di yi F 
x(t) — Z (i™ +e/™) = oe (e3™" + et) + = (ei™ + em) ee 


4 4 4 
= —cos(mt) — — cos(3mt) + —cos(S5nt) —--- 
T Sr 


3x 
4 pt 
25 os ——— cos[(2k + 1) x2]. (2.2.11) 
T 2k+1 
k=0 
E 
Example 2.2.3 
Determine the Fourier-series representation of an impulse train denoted by 
+00 
x(t) = D 5(t — nTo) (2.2.12) 
n=—0o0 
and shown in Figure 2.28. 
Solution We have 
+9 ee 
PEEN r3 x(t)e To'dt 
To J-2 
E ne ee 
=— 3(t)e 7" %' dt 
To -9 
=e. (2.2.13) 
=h 2, 
With these coefficients, we have the following expansion: 
+00 1 $ 3 
Ea j2nžt 
X t-n) = 7A ye, (2.2.14) 
n=—0O n=—0O 
This is a very useful relation, and we will employ it frequently. a 


x(4) 





Figure 2.28 An impulse train. 
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Positive and Negative Frequencies. We have seen that the Fourier-series expan- 
sion of a periodic signal x(t) is expressed as 


foe} A 
j2n Ft 
x(t) = J dne TT, 


n=—~00 


in whichall positive and negative multiples of the fundamental frequency 7 are present. A 


positive frequency corresponds to a term of the form e/” (for a positive œ), and a negative 
frequency corresponds to e~/“!. The term e/“ corresponds to a phasor rotating counter- 
clockwise at an angular frequency of w, and e~/” corresponds to a phasor rotating clock- 
wise at the same angular frequency. A plot of these two phasors is shown in Figure 2.29. 

Note that if the two signals et and e—J¢! are added, their sum is 2.cos wt, which is 
a real signal with two frequency components at +5". We will soon see that this property 
holds for all real signals; i.e., in real signals, frequencies appear in positive and negative 
pairs with complex-valued amplitudes that are conjugates. 


Fourier Series for Real Signals. If the signal x(¢) is a real signal satisfying the 
conditions of the Fourier-series theorem, then there must be alternative ways to expand the 
signal. For real x(t), we have 


ll 
taj 
~ 
A 
VY 
m 
SA 
N 
a 
` 
a 
~ 


Xn 
To a 


ll 
1 
S|- 
~ 

+ 
Ey 
=x 
= 

GA 

w 

Q 

S] 

` 

a 

~~ 
Uy 

* 


= xe (2.2.15) 


This means that for real x (t), the positive and negative coefficients are conjugates. Hence, 
|xn| has even symmetry (|x,| = |x — n|) and 2x, has odd symmetry (2x, = —Zx_,) with 
respect to the n = 0 axis. An example of the discrete spectrum for a real signal is shown in 
Figure 2.30. Š 


Figure 2.29 Phasors representing positive and negative 
frequencies. 
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è Figure 2.30 Discrete spectrum of a 
real-valued signal. 
From x_, = x}, it follows that if we denote 
an — jbn 
A 
then 
an + jbn 
G 


therefore, forn > 1, 


j2n det —jan4t An — JDn janet ant Ibn —j2n#-t 
xne TT xne TT = Zoe To + E To 


= An COS (2x2) + by, sin (2x5) ; 


Since xo is real and given as x9 = 3, we conclude that 


foe) 
x(t) = S + 2 E cos (25t) + b, sin (2x2) . (2.2.16) 


This relation, which only holds for real periodic signals, is called the trigonometric Fourier- 
series expansion. To obtain a, and bn, we have 


G@n— jb, 1 eth -j2n ft 
Xn = FS — x(t)e 0 dt; 
i 2 To Ja 6) 
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therefore, 


. +H ° at+To 
Qn —jbn _ 1 [* x) slir A Ge | x(t) sin (2x21) dt. 
2 To a To To 2 To 


Thus, we obtain 


æa+To 
P x(t) cos (2x21) dt, (2.2.17) 
0 


To a 
2 at+To ( ) 2 n ) d (2.2 18) 
a t —t } dt. 2, 
bn =f x sin ( 7 





A third way exists to represent the Fourier-series expansion of a real signal. Note that 


——— 


xpe + Mie = 2}x,| cos (orit + ixn) : (2.2.19) 
0 
Substituting Equation (2.2.19) in Equation (2.2.2), we have 


oe n 
x(t) = xo + 25 Xp] co (27 T + Ln) . (2.2.20) 


n=1 3 


In summary, for a real periodic signal x(t), we have three alternatives to represent 
the Fourier-series expansion 


x= >> ge hh (2.2.21) 
Zee 3 an COS (2x21) + b, sin (2n2+)| (2.2.22) 
2 ae > To To 
2 n 
= x9 +2 2 |x, | cos (2r + Lx») ‘ (2.2.23) 


where the corresponding coefficients are obtained from 


1 at+To bn 





- => Pah ae = 2j 2.2.24 
Xn = 7J. x(t)e o dt > J 7’ ( ) 
2 a+To n 
an = af x(t) cos (2x71) dt, (2.2.25) 
a+To 
bes Z | x(@)sin (2x21) dt, (2.2.26) 
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1 
[xn | = zvan + b?, (2.2.27) 


LXn = — arctan (>) : (2.2.28) 


ay 
Example 2.2.4 
Determine the sine and cosine coefficients in Example 2.2.1. 


Solution We have previously seen that 


T nt Qn — jbn 
Xn = — sinc | — ) = 
To To = 2 
Therefore, 
= 7 sinc (£) 
b, = 
and 








= |2t cine (mE 
|x,| = 2 sinc (£) - 
Lx, =Oorn 


Fourier-Series Expansion for Even and Odd Signals. If, in addition to being 
real, a signal is either even or odd, then the Fourier-series expansion can be further simpli- 
fied. For even x(t), we have 


pee ? (t)sin( 2x—t) dt =0 (2.2.29) 
ale sin Th =0. .2. 


Since x(t) sin(2x wt) is the product of an even and an odd signal, it will be odd and its 
integral will be zero. Therefore, for even signals, the Fourier-series expansion has only 
cosine terms, i.e., we have 


oe) 
x(t) = S + J an cos (2x21) l (2.2.30) 


n=1 0 
Equivalently, since x, = tei we conclude that for an even signal, every x, is real (or 
all phases are either 0 or x, depending on the sign of xn). 
For odd signals, we can conclude in a similar way that every a, vanishes; there- 
fore, the Fourier-series expansion only contains the sine terms or, equivalently, every x, is 
imaginary. In this case, we have 


x(t) =), sin (2m1) (2.2.31) 


n=1 


Example 2.2.5 
Assuming Ty = 2, determine the Fourier-series expansion of the signal shown in Figure 2.31. 
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Figure 2.31 An example of an 
odd-harmonic signal. 





Solution For n = 0, we can easily verify that xo = 0. For n 4 0, we have 


1 Ig 
2 _~j2xn# 
m=z f xte ?"% dt 
To J_% 


1 fi ; 
= 5 [sem dt 


1 0 . +1 , 
=- [y (=t — Del™ dt +f seian , 
2 [J-i 0 








Using 
1 1 
te” dt = —te“ — —e™, 
a a? 
we have 
tem dt = L peim 4 l gim, 
nn n?n? 
therefore, 
J —junt : 1 —junt 3 
X, = —->— te e 
2nn -) 2n?n? 1 
j 0 j 1 1 1 
a -2 eT inn te im + eiT" 
2nn Hi t 2nn 9 2n? 0 


(cos(xn) — 1) (a3 + x) 
T 


n? 22nn 


_ | -2-4 nod 
0 


n even 
Noting that x, = on tee we have 
— a 
oe =z, nodd 
= 
0 neven 


and 


2 nodd 
b, = : 
0 neven 
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The Fourier-series expansion is 


o0 


2 ; 4 
x(t) = > lao sin(2n + 1)mt — (Qn b 1? cos(2n + Dar! ; 


n=0 
Obviously, this signal contains only the odd harmonics. No even harmonics are present in the 
Fourier-series expansion of this signal. m 


2.2.2 Response of LTI Systems to Periodic Signals 


As we have already observed, the response of an LTI system to a complex exponential is 
a complex exponential with the same frequency and a change in amplitude and phase. In 
particular, if h(t) is the impulse response of the system, then from Example 2.1.25, we 
know that the response to the exponential ¢/2"/0' is H(fo)e/?™/0', where 


+00 
H(f) = / hei "dt. 


Now let us assume that x(t), the input to the LTI system, is periodic with period Tọ and has 
a Fourier-series representation 


+00 sai 
At 
x(t) = > xpel TTo", 
n=—00 


Then we have 


y(t) = ix (t)] 


n nt 
= mH (=) oh, (2.2.32) 


where 


+00 
H(f)= J hee "dt. 


ioe) 
From this relation, we can draw the following conclusions: 
e Ifthe input to an LTI system is periodic with period Tọ, then the output is also peri- 


odic. (What is the period of the output?) The output has a Fourier-series expansion 
given by 


+00 ar 
y(t)= > ye", 


n=—oo 
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where 


This is equivalent to 


lynl = |x*nl - 





(a) 


n 
Ly, = Lxn + LH | — }. 
Í i (3) 





and 


* Only the frequency components that are present at the input can be present at the 
output. This means that an LTI system cannot introduce new frequency components 
in the output, if these components are different from those already present at the 
input. In other words, all systems capable of introducing new frequency components 
are either nonlinear and/or time varying. 


° The amount of change in amplitude |a (+) and phase ¿H (+) are functions of 
n, the harmonic order, and h(t), the impulse response of the system. The function 


+00 

H(f)= l hitje idt (2.2.33) 
=00 

is called the frequency response or frequency characteristics of the LTI system. In 

general, H (f) is a complex function that can be described by its magnitude |H (f)| 

and phase ZH (f). The function H (f), or equivalently A(t), is the only information 

needed to find the output of an LTI system for a given periodic input. 


Example 2.2.6 


Let x(t) denote the signal shown in Figure 2.27, but set the period equal to Tọ = 1075 sec. 
This signal is passed through a filter with the frequency response depicted in Figure 2.32. 
Determine the output of the filter. 


Solution We first start with the Fourier-series expansion of the input. This can be easily 
obtained as 





4 & -1 
x(t) = = L E FT cos (20 (2n + 1)10°t) 


n=! 


2 
= —e 
T 


2 


; 5 ; S 
j2n10°t ae = e7J2n10 t 
T 


2 eimir 2 j6x10°r 
3x 3x 


4 2 pilonio% + 2 -i1010 EEN (2.2.34) 
5x Sx 
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600 f (kHz) 





Figure 2.32 Frequency response of the filter. 


To find the output corresponding to each frequency, we must multiply the coefficient of each 
frequency component by the H (f) corresponding to that frequency. Therefore, we have 


H(10°) = lei? =e/?, 
H(—10°) = le? = e742, 

H(3 x 10°) = 3e/7, 
H(-3 x 10°) = 3e77?, 

H(5 x 10°) = Se/?, 
H(—5 x 10°) = 5e~/2. (2.2.35) 

For higher frequencies, H (f) = 0. Therefore, we have 
y(t) = 2 jOm) + 2 pj) 
x x 


_ 2 ,K6m10%+3) = 2 jOm) 


T T 
42 pitoni0%+§) +4 2 5 i(—10x105-§) 
T T 


or, equivalently, 


4 $ 4, 5 4 5 
y(t) = —— sin(2x10°t) + — sin(6x10°t) — — sin(10x10°t). (2.2.36) 
1 1 x E 


2.2.3 Parseval’s Relation 


Parseval’s relation says that the power content of a periodic signal is the sum of the power 
contents of its components in the Fourier-series representation of that signal. This relation 
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is a consequence of the orthogonality of the basis used for the Fourier-series expansion, 
i.e., the exponential signals. 

Let us assume that the Fourier-series representation of the periodic signal x(t) is 
given by 


+00 sacl 
x(t) = 5 xpel To". 
n=—00 
Then, the complex conjugate of both sides of this relation is 
+00 MEF 
x*(t) = y xte TN, 
n=—0o 


By multiplying the two equations, we obtain 


+00 
Ora >. x R 


n=—00 m=—00 





Next, we integrate both sides over one period and note 


a+h . _ = 
joniz _JfIpn=m 
e 0 dt = Tom =| ; 
| ms 0 ngm 


Thus, 


a+7o +o +00 
[ora = Y Y xti Tab 
a 


n=—O m=- 


+00 
=m $. af (2.2.37) 
n=—00 
or, equivalently, 
1 a+To f +00 
= f kdt = $, xb. (2.2.38) 
To a n=—0o 


This is the formal statement of Parseval’s relation. Note that by Equation (2.1.14) the left- 
hand side of this relation is P,, the power content of the signal x(t), and |x, |? is the power 


jon 
content of xe % , the nth harmonic. Therefore, Parseval’s relation says that the power 
content of the peiiodic signal is the sum of the power contents of its harmonics. 

If we substitute x, = an—Jbn in Parseval’s relation, we obtain 


2 
1 ett a2 1 & 
P, = — edt = 2 +- b?). 2.2.39 
3 F kOPd = 7 +32, (an +a) (2.2.39) 


Since the power content of a, cos (2x +) and b,, sin (2x +) are % 2 and ŽA a. respectively, 
we see that the power content of x (t) is the sum of the power She of its harmonics. 
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Example 2.2.7 
Determine the power contents of the input and output signals in Example 2.2.6. 


Solution We have 
1 få a af 3 
Py== | kPa = E f vae+ [tae | =1. 
Adeg 2 Ls l 


We could employ Parseval’s relation to obtain the same result. To do this, we have 


Equating the two relations, we obtain 


> Ty aS x? 
=a 
ra (2k + 1) 8 
To find the output power, we have 
ie ae 2 b 1716 16 16] 24 
P= dt = = = -| — + — + — | = —. L] 
2 al g a; TEREE x? 


n=l 


2.3 FOURIER TRANSFORM 


2.3.1 From Fourier Series to Fourier Transforms 


As we have observed, the Fourier series may be used to represent a periodic signal in 
terms of complex exponentials. This Fourier-series representation considerably decreases 
the complexity of the description of the periodic signal. Simultaneously, the series rep- 
resentation in terms of complex exponentials is particularly useful when analyzing LTI 
systems. This is because the complex exponentials are the eigenfunctions of LTI systems. 
Equivalent to any periodic signal is the sequence {x,} of the Fourier-series ex pansion coef- 
ficients which, together with the fundamental frequency fo, completely describe the signal. 

In this section, we will apply the Fourier-series representation to nonperiodic signals. 
We will see that it is still possible to expand a nonperiodic signal in terms of complex 
exponentials. However, the resulting spectrum will no longer be discrete. In other words, 
the spectrum of nonperiodic signals covers a continuous range of frequencies. This result 
is the well-known Fourier transform given next. 

First, the signal x(t) must satisfy the following Dirichlet conditions: 


1. x(t) is absolutely integrable on the real line; i.e., 
+00 
/ |x(t)|dt < oo. 
—00 


2. The number of maxima and minima of x(t) in any finite interval on the real line is 
finite. 


3. The number of discontinuities of x(t) in any finite interval on the real line is finite. 
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Then the Fourier transform (or Fourier integral) of x(t), defined by 


+00 
X(f)= / x(the "dt, (2.3.1) 


oe) 


exists and the original signal can be obtained from its Fourier transform by 


+00 
x(t) = / X (Pei df. (2.3.2) 


We make the following observations concerning the Fourier transform: 


° X(f) is generally a complex function. Its magnitude |X (f)| and phase ZX (f) rep- 
resent the amplitude and phase of various frequency components in x(t). The func- 
tion X (f) is sometimes referred to as the spectrum? of the signal x(t). 


To denote that X(f) is the Fourier transform of x(t), we frequently employ the 
following notation: 


X(f) = F[x@)]. 


To denote that x(t) is the inverse Fourier transform of X (f), we use the following 
notation: 


x(t) = F"[X(f)I. 
Sometimes we use a shorthand for both relations: 


x(t) & X(f). 


If the variable in the Fourier transform is w rather than f, then we have 


+00 p 
X(@) = ih x(t)e J” dt 


[o 0] 
and 
1 ft° . 
x(t) = ~ | X (w)e!™ do. 
2n 


—00 


The Fourier-wansform and the inverse Fourier-transform relations can be written as 
X(f) 
nA 
+00 +00 : i 
x(t) = i | / x(t)e Jat a| e7ftaf 


—00 co 


+00 +o - 
= / | / eli ag) x) dr. (2.3.3) 


—00 co 


where in the last step we have exchanged the order of integration. On the other hand, 


+00 
x(t) = / o(t — t)x(t) dt. (2.3.4) 


—00 


5Sometimes X (f) is referred to as a voltage spectrum, as opposed to a power spectrum, which will be 
defined later. 
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Comparing Equation (2.3.3) with Equation (2.3.4), we obtain 


+00 
ôlt— Tt) = i eirf E-dqF (2.3.5) 
—00 
or 
+œ 
ôl) = i edf. (2.3.6) 
-00 


° Equation (2.3.1) is similar to Equation (2.2.3) and Equation (2.3.2) is similar to 
Equation (2.2.2). The difference is that the sum in Equation (2.2.2) is substituted 


by the integral. z 
Example 2.3.1 
Determine the Fourier transform of the signal TI (ż) given in Equation (2.1.15) and shown in 
Figure 2.33. 


Solution We have 


+00 
FT) = / T(t)e2*Stap 


00 


1 


+3 
= f e`if'dt 
1a 


2 
1 
—jonf 
_ sinnf 
EET 
= sinc( f). (2.3.7) 





[eins — ef] 








5 f  Figure2.33 TI(t) and its Fourier 
transform. 
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Therefore, 
F [TI (t)] = sinc( f). 
Figure 2.33 illustrates the Fourier-transform relationship for this signal. a 
Example 2.3.2 


Determine the Fourier transform of an impulse signal x(t) = ê (t). 


Solution The Fourier transform can be obtained by 
+00 
F[5(0)] = / leidt 
—0o 


=1 (2.3.8) 


where we have used the sifting property of 5(t). This shows that all frequencies are present in 
the spectrum of 6(¢) with unity magnitude and zero phase. The graphs of x(t) and its Fourier 
transform are given in Figure 2.34. Similarly, from the relation 


+00 . 
Í lfe adf = 1, 


o0 


we conclude that 


F] = 6(f). a 


Example 2.3.3 
Determine the Fourier transform of signal sgn(t). 


Solution We begin with the definition of sgn(t) as a limit of an exponential, as shown in 
Figure 2.18 and given by 


eri t>0 
Xn(t) = {| —er t<0. (2.3.9) 
í 0 t=0 


C) 


~t 


Figure 2.34 Impulse signal and its spectrum. 
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For this signal, the Fourier transform is 
Xa (f) m F [xn (t)] 


0 +00 
(-e*) eo P*Ftdt + f ene idt 
0 


—00 
0 +00 
Z -f eta -P"f) dt + Í et GtnN gt 
—0o 0 
S 1 a 1 
~ d jmf 1+ janf 
—j4 i 
e a (2.3.10) 
ne + 4n? f2 
Now, letting n — oo, we obtain 
F [sgn(t)] = lim Xa (f) 
-ján f 
— neo 1+4 f? 
EE (2.3.11) 
inf 
The graphs of sgn(t) and its spectrum are shown in Figure 2.35. a 


Fourier Transform of Real, Even, and Odd Signals. The Fourier-transform 
relation can be generally written as 


+00 


F ixt] = f x(the "dt 


—00 
+00 +00 
= J x(t)cos(2nft)dt — j / x(t) sin(2n ft) dt. 
—00 -00 
For real x(t), both integrals 
+00 
l x(t)cos(2x ft) dt 
“aG 
and 
+00 
/ x(t) sin(2n ft) dt 
—00 


are real; therefore, they denote the real and imaginary parts of X (f), respectively. Since 
cos is an even function and sin is an odd function, we know that for real x(t) the real part 
of X (f) is an even function of f and the imaginary part is an odd function of f. Therefore, 
in general, for real x(t), the transform X (f) is a Hermitian function 


X(-f) = X*(f). 
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sgn (t) į 





|F[sgn (II 


4 F[sgn (1)] 


Figure 2.35 The signum signal and its 
spectrum. 





This is equivalent to the following relations: 
Re[X (— f)] = Re[X(f)], 
Im[X (—f)] = —Im[X(f)], 
IX(-A)I=IX(AL 
LX(—f) = —-LX(f). 
Typical plots of |X (f)| and ZX (f) for a real x(t) are given in Figure 2.36. 


If, in addition to being real, x(t) is an even signal, then the integral 


+00 
i x(t) sin(2x ft) dt 
—00 

vanishes because the integrand is the product of even and odd signals; therefore, it is odd. 
Hence, the Fourier transform X (f) will be real and even. Similarly, if x(t) is real and odd, 
the real part of its Fourier transform vanishes and X (f) will be imaginary and odd. 
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IX(f)| 


L£X(f) 


Figure 2.36 Magnitude and phase of the spectrum 
of a real signal. 


Signal Bandwidth. The bandwidth of a signal represents the range of frequencies 
present in the signal. The higher the bandwidth, the larger the variations in the frequencies 
present. In general, we define the bandwidth of a real signal x(t) as the range of positive 
frequencies present in the signal. In order to find the bandwidth of x(t), we first find X (f), 
which is the Fourier transform of x(t); then, we find the range of positive frequencies that 
X(f) occupies. The bandwidth is BW = Wmax — Wmin, Where W max is the highest positive 
frequency present in X (f) and Wyin is the lowest positive frequency present in X (f). 


2.3.2 Basic Properties of the Fourier Transform 


In this section, we will develop the basic properties of the Fourier transform. With each 
property, we will present examples of its applications. 


Linearity. The Fourier-transform operation is linear. That is, if xı (t) and x2(t) 
are signals possessing Fourier transforms X(f) and X2(f), respectively, the Fourier tans- 
form of æxı (t) + Bx2(t) is aX,(f) + BX2(f), where œ and £ are two arbitrary (real or 
complex) scalars. This property is a direct result of the linearity of integration. 


Example 2.3.4 
Determine the Fourier transform of u_, (t), the unit-step signal. 


Solution Using the relation 


1 1 1 1 
u_\(t) = 3 + 7sen(t) = 5 x1+ 7sentt) 


and the linearity theorem, we obtain 


Flu_i(t))= F $ xl+ zo| 


1 1 
= 5(f) $ PTA (2.3.12) 
a 
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Duality. If 
X(f) = F[x()], 
then 
x(f) = F[X(—t)] 


and 


x(—f) = F[X()]. 
To show this property, we begin with the inverse Fourier-transform relation 
+00 , 
x(t) = i) X (fe? df. 

—00 
Then, we introduce the change of variable u = — f to obtain 

+00 ; 

x(t) = J X(—u)je P ™ du. 
—00 


Letting t = f, we have 
+00 : 
Wel X (—u)e ?™! du; 


o0 
finally, substituting ¢ for u, we get 


+00 
x(f)= J X (=t)? ™f dt 


or 
x(f) = F[X(-t)]. (2.3.13) 


Using the same technique once more, we obtain 


x(—f) = F[X(t)]. (2.3.14) 


Example 2.3.5 
Determine the Fourier transform of sinc(t). 


Solution Noting that I(t) is an even signal and, therefore, that II(— f) = TI( f), we can 
use the duality theorem to obtain 


F [sinc(t)] = M(—f) = Mf). (2.3.15) 
a 


Example 2.3.6 
Determine the Fourier transform of L é 


Solution Here again, we apply the duality theorem to the transform pair derived in Example 2.3.3 


F [sgn(t)] = a 


66 Signals and Linear Systems Chapter 2 


to obtain 


1 

a| =h = =se. 
jut 

Using the linearity theorem, we have 


s|] ey (2.3.16) 


Shift in Time Domain. A shift of fo in the time origin causes a phase shift of 
—21f to in the frequency domain. In other words, 


F [x(t — to)] =e 7" F0 F[x(t)]. 


To see this, we start with the Fourier transform of x(t — tọ), namely, 


CO 
F [x(t — t)] = / x(t — toe? "dt. 
—00 
With a change of variable of u = t — tọ, we obtain 
oo . . 
F [x(t —t)] = / x(uje i? e-i Igy 
—00 


[o0] 
sgam] x(u)je ~?" "du 
—00 


= eI o ¢[x(t)]. (2.3.17) 


Note that a change in the time origin does not change the magnitude of the transform. 
It only introduces a phase shift linearly proportional to the time shift (or delay). 


Example 2.3.7 
Determine the Fourier transform of the signal shown in Figure 2.37. 


xe) =m (1-3). 


By applying the shift theorem, we obtain 


Solution We have 


F x(t)] = eII} = eI sinc(f). (2.3.18) 
5 a 





Figure 2.37 Signal x(t). 


Section 2.3 Fourier Transform 5 67 


Example 2.3.8 
Determine the Fourier transform of the impulse train 
CO 
>. 8t —nTp). 
n=-0 


Solution Applying the shift theorem, we have 


F [S(t —nTo)] = e727" F[S(t)] = e71, 


Therefore, 
foe] 0 
F l > 8l a) = D etm fnTy_ 
A=—0O n=—00 
Using Equation (2.2.14) 
+00 1 +00 . 
= jant 
> ôt — no) = = D e To 
n=—00 n=—00 


and substituting f for t and EA for To, we obtain 


oo n foe} oo 
— —ļ = j2unfT> — — j2unfl 
Lols )=h De ee Tia b (2.3.19) 
n=—00 n=—00 n=—00 
or 
œo 1 œo n 
-j2nnfTo _ * s(f——). 3. 
die ii D (1 =) (2.3.20) 
n=—00 n=~—0O 


This relation yields, 


F | Tsa -am| z + >: 5( = z, (2.3.21) 


n=—00 n=- 


The case of To = 1 is particularly interesting. For this case, we have 


GF l Yo se -»| = }_ êl -n). (2.3.22) 


n=—00 n=—00 


That is, after substituting f for t, we find that the Fourier transform of re d(t — n) is 


na=—OO 


itself. : r 


Scaling. For any real a Æ 0, we have 


F [x(at)] = ty (£) À (2.3.23) 
la| \a 
To see this, we note that 
F (x(at)) = J x(atyeP™I ge (2.3.24) 
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and make the change in variable u = at. Then, 


I f¢” A 
F[x(at)] = zS x(u)je ~i? "adu 
a| J- 


_ty (2) l (2.3.25) 


lal 


where we have treated the casesa > 0 anda < 0 separately. 

Note that in the previous expression, if a > 1, then x(at) is a contracted form of 
x(t), whereas ifa < 1, x(at) is an expanded version of x(t). Thus, if we expand a signal 
in the time domain, its frequency-domain representation (Fourier transform) contracts; if 
we contract a signal in the time domain, its frequency domain representation expands. This 
is exactly what we expect since contracting a signal in the time domain makes the changes 
in the signal more abrupt, thus increasing its frequency content. 


Example 2.3.9 
Determine the Fourier transform of the signal 


3 0<t<4 
x(t) = i 
0 otherwise 


Solution Note that x(t) is a rectangular signal amplified by a factor of 3, expanded by a 
factor of 4, and then shifted to the right by 2. In other words, x(t) = 31 (=). Using the 
linearity, time shift, and scaling properties, we have 


E] 
sens Ça) 


= 12e“ ™ sinc(4 f). m 


Convolution. If the signals x(t) and y(t) both possess Fourier transforms, then 
F (x(t) x yA] = Fl[x(t)] - FIA = X(f)- YC). (2.3.26) 


For a proof, we have 


=00 —00 


co (oe) 
= / x(t) | J y(t — re ISE- ar| e Pri ar. 
= 


—00 


F (x(t) * y(t)] a | [7 roe sag) a| ef? ht at 
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Now with the change of variable u = t — t, we have 


oo À, oo À, 
/ y(t z te iS -ddt = J yue "du 
—00 —oo 


= ¥[y(t)] 
= Y(f); 


therefore, 


o0 


TRS l XOY (fle Pdr 


= X(f)-Y(f). (2.3.27) 


This theorem is very important and is a direct result of the fact that the complex 
exponentials are eigenfunctions of LTI systems (or, equivalently, eigenfunctions of the 
convolution operation). Finding the response of an LTI system to a given input is much 
easier in the frequency domain than it is the time domain. This theorem is the basis of the 
frequency-domain analysis of LTI systems. 


Example 2.3.10 
Determine the Fourier transform of the signal A (t), shown in Figure 2.15. 


Solution It is enough to note that A(t) = II(¢) » I(¢) and use the convolution theorem. 
Thus, we obtain 


FIA] = FA- FITC) = sinc? (f). (2.3.28) 


Example 2.3.11 


Determine the Fourier transform of the signal x(t) = TI (£) +A (4); which is shown in 
Figure 2.16 and discussed in Example 2.1.14. 


Solution Using scaling and linearity, we have 


X(f) = 4sinc(4f) + 2sinc?(2 f ). m 


_ Modulation. The Fourier transform of x(t)e/?"/0' is X(f — fo). To show this 
relation, we have 


o0 
F|x (r)e?* to] = J xei ot e-i nf gy 


—00 


loo} 
=i) xe IS -odt 


oe) 


= X(f — fo). (2.3.29) 


This theorem is the dual of the time-shift theorem. The time-shift theorem says that 
a shift in the time domain results in a multiplication by a complex exponential in the 


70 Signals and Linear Systems Chapter 2 


frequency domain. The modulation theorem states that a multiplication in the time domain 
by a complex exponential results in a shift in the frequency domain. A shift in the frequency 
domain is usually called modulation. 


Example 2.3.12 
Determine the Fourier transform of x(t) = e/2™', 


Solution Using the modulation theorem, we have 


F [eit] = F [leit hoy 


= 8(f — fo). 
Note that, since x (t) is not a real signal, its Fourier transform does not have Hermitian sym- 
metry. (The magnitude is not even and the phase is not odd.) | 


Example 2.3.13 
i Determine the Fourier transform of the signal cos(2x fot). 


Solution Using Euler’s relation, we have cos(2x fot) = eirf + leiado, Now using the 
linearity property and the result of Example 2.3.12, we have 


i 1 ; 
F [cos(27 fot)] = LSet] + 5 Fle Pri] 


E- fi) + ZEE + fi. : 


Nie 


Example 2.3.14 
Determine the Fourier transform of the signal 


x(t) cos(2nx fot). 


Solution We have 


F (x(t) cos(2n fot)] = F [sxe + Joem] 


1 1 
= z740 — fo) + XE + fo). (2.3.30) 
Figure 2.38 shows a graph of this relation. In Chapter 3, we will see that this relation is the 
basis of the operation of amplitude modulation systems. m 
Example 2.3.15 


Determine the Fourier transform of the signal 


cos(xt) |t| < 5 


x(t) = 
0 otherwise 


(2.3.31) 


shown in Figure 2.39. 
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i x(t) 
0.8 X(f) 
0.6 
0.4 l 
ob l ] f 
x(t) cos 2xfo t 





Figure 2.38 Effect of modulation in both the time and frequency domain. 


—0.5 —0.3 —0.1 0 0.1 03 0.5 t  Figure2.39 Signal x(t). 


$ 


Solution Note that x(t) can be expressed as 


x(t) = TI (t) cos(xt). ` (2.3.32) 
Therefore, 
F[MN(t) cos(xt)] = Like (+ — 5) + ae (+ + z) ; (2.3.33) 
2 2 2 2 
where we have used the result of Example 2.3.14, with fo = L. E 


Parsevaľs Relation. If the Fourier transforms of the signals x(t) and y(t) are 
denoted by X (f) and Y (f), respectively, then 


/ x(t)y*(t) dt = J X(f)Y*(f) df. (2.3.34) 


o0 —=00 


mea GU 
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To prove Parseval’s relation, we note that 


io x(t)y*(t) Apes is m xcneay If reperar | dt 
-p [irora] [freeman 


-Pro fron [Lena a 


Now using Equation (2.3.5), we have 


Í emf Hat =8(f — f’); 


—00 


therefore, 


j sorod = f xol PAS- Paf |p 


z J X(P¥*(faf, (2.3.35) 


where we have employed the sifting property of the impulse signal in the last step. 
Note that if we let y(t) = x(t), we obtain 


foe} foe} 
J |x(t)|?dt = / IX (fdf. (2.3.36) 
—00 —0o 
This is known as Rayleigh’s theorem and is similar to Parseval’s relation for periodic 
signals. 
Example 2.3.16 
Using Parseval’s theorem, determine the values of the integrals 
f sinc’ (t) dt 
—00 
and 
| sinc’ (t) dt. 
—00 


Solution We have F [sinc?(t)] = A(t). Therefore, using Rayleigh’s theorem, we get 


f sinta = f A’ (f)df 


0 1 
=f g+Daf+ | (f +1)*df 
=f 0 
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Figure 2.40 Product of I and A. 


For the second integral, we note that ¥ [sinc(t)] = T (f), therefore, by Parseval’s theorem, 
we have 


E sinc? (t)sinc(t) dt = S N(AACS) df. 


—0o — 


Figure 2.40 shows the product of TI (f) and A( f). From this figure, we see that 


[naar =i x l ya x L, 
ae ee ae OD: 
therefore, 
a 3 
J sinc’(t) dt = 7- C] 
—00 


Autocorrelation. The (time) autocorrelation function of the signal x(t) is denoted 
by R(t) and is defined by 


R,(t) = / x(t)x*(t — T) dt. (2.3.37) 


foe) 


The autocorrelation theorem states that f 
FIR: (T)] = IX (P)? (2.3.38) 


We note that R, (t) = x(t)*«x*(—T). By using the convolution theorem, the autocorrelation 
theorem follows easily.® 

From this theorem, we conclude that the Fourier transform of the autocorrelation of 
a signal is always a real-valued, positive function. 


Differentiation. The Fourier transform of the derivative of a signal can be obtained 
from the relation 


F [50] = j2nfX(f). (2.3.39) 


The definition of the autocorrelation function for random signals is given in Chapter 5. 


z 
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To see this, we have 


ad _@ s j2nft 
Eksi f X(f af 


—00 
9; . 
= | pampxcne™ ag, 
—0o 
Thus, we conclude that 


ae od 
oaa gO 
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(2.3.40) 


(2.3.41) 


or 
d : 
F | —x(t)| = j2nfX(f). 
dt 
With repeated application of the differentiation theorem, we obtain the relation 
g | Sx) | = GX) 
dt” ie 
Example 2.3.17 


Determine the Fourier transform of the signal shown in Figure 2.41. 


Solution Obviously, x(t) = £A (t). Therefore, by applying the differentiation theorem, we 


have 


d 
Fix] = F [40 
= janf FIACH] 
= j2nfsinc*(f). 


Differentiation in Frequency Domain. We begin with 


oa 
Fltx(t)] = ape 


Figure2.41 Signal x(t). 


(2.3.42) 


(2.3.43) 
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The proof is basically the same as the differentiation theorem in the time domain and is left 
as an exercise. Repeated use of this theorem yields 





j 
Flt?x(t)] = | — 2.3.44 
[7"x(t)] (4) 4 pk): (2.3.44) 
Example 2.3.18 
Determine the Fourier transform of x(t) = t. 


Solution We have 


LW (f). (2.3.45) 
a a 


Integration. The Fourier transform of the integral of a signal can be determined 


from the relation 


F [/ x(t) d| = tt +z 72 Of). (2.3.46) 


To show this, we use the result of Problem 2.15 to obtain 
t 
/ x(t)dt = x(t) *u_j(t). 
-00 


Now using the convolution theorem and the Fourier transform of u_;(t), we have 


t 1 
g il soar] = X(f) Las + z0] 


_ X(f) 
aaf * 5X (0s (f). (2.3.47) 


Example 2.3.19 
Determine the Fourier transform of the signal shown in Figure 2.42. 


Solution Note that S 
o= f I(r) dt. 
—oo 


Therefore, by using the integration theorem, we obtain 








F [x] = Dy + 5sine(0)3¢ f) 
_ sinc(f) | 1 
= -anf * 25h. (2.3.48) 
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t Figure 2.42 Signal x(t) in Example 
2.3.19. 


1 1 
2 2 


Moments. If #[x(t)] = X(f), then ee t"x(t) dt, the nth moment of x(t), can 
be obtained from the relation 








T t"x(t) dt = (4) a xP) ; (2.3.49) 
=o 2x) df" f=0 
This can be shown by using the differentiation in the frequency domain result. We have 
F[t’x(t)] = (4) a X(f). (2.3.50) 
2x) df” 


This means that 





He t"x(the "Ft gt = () ia X(f). 


Letting f = 0 on both sides, we obtain the desired result. 
For the special case of n = 0, we obtain this simple relation for finding the area 
under a signal, i.e., 


/ x(t) dt = X(0). (2.3.51) 


Example 2.3.20 
Let a > 0 and x(t) = e% u_ (t). Determine the nth moment of x(t). 


Solution First we solve for X (f). We have 
foe} 
X(f) = / et u_y (the dt 
= 


œ 
= | e™ etin ft) dt 
0 


1 


= —-——— (0-1 
a+ j2nf € ) 
1 
Zeo 2.3.52 
a+ j2nf ( ) 
By differentiating n times, we obtain 
d” — j2)" 
sak) = Pans ba A n ; (2.3.53) 
df" (a+ jan f)rt! 
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hence, 
Í * teu odr=(L) jm 
a n 27 ntl 
1 
= (2.3.54) 
P m 
Example 2.3.21 
Determine the Fourier transform of x(t) = e7*!"!, where œ > 0. (See Figure 2.43.) 
Solution We have 
x(t) =e™u_i(t) + e“u_1(—t) = x (t) + xı(~t), (2.3.55) 
and we have already seen that 
Flu] = Fieu] = ———_.. 
k] [e u(t] a+ jonf 
By using the scaling theorem with a = —1, we obtain 
Fixi (—t))] = F [e“u_, (—t)] = ———. 
RD = Flew = a 
Hence, by the linearity property, we have 
1 1 
@ feral] = a O 
ee Ga goah aa jon 
2a 
= 44m ft (2.3.56) 
= 
1 
0 t 
F [e7] 
1 


Figure 2.43 Signal e~*!"! and its 
0 f Fourier transform. 
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TABLE 2.1 TABLE OF FOURIER-TRANSFORM PAIRS 


Time Domain 


Frequency Domain 





<- 8(t) 1 
1 5(f) 
b(t — to) eifo 
ei?rhot êlf — fo) 
cos(2m fot) Ea- fo) + ZIG + fo) 
sin(27 fot) -500 + fo) + = — fo) 
n(t) sinc( f) 
sinc(t) (f) 
A(t) sinc? (f) 
sinc? (t) A(f) 
at 1 
e u(t), «>00 a+ jf 
ot ee as 
eC ees (@ + jonfy? 
= 2a 
š ee cee 
a? + (2x fY 
en ent? 
1 
sgn(t) TaT 
Laf 1 
u(t) 3 )+ jonf 
ô’ (t) j2nf 
6 (t) (j2nf)" 
L —jnsen(f) 
n=+00 1 n=+00 
D „ôC nT) T Da ô (F = =) 
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Table 2.1 provides a collection of frequently used Fourier-transform pairs. Table 2.2, 
outlines the main properties of the Fourier transform. 


2.3.3 Fourier Transform for Periodic Signals 

i] 
In this section, we extend our results to develop methods for finding the Fourier transform 
of periodic signals. We have already obtained the Fourier transform for some periodic sig- 
nals (see Table 2.1). These include e/?"/0', cos(2x fot), sin(21 fot), and Seat 5(t—nTp). 
The Fourier transform of all these periodic signals have a common property: the Fourier 
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TABLE 2.2 TABLE OF FOURIER-TRANSFORM PROPERTIES 


Signal ~ Fourier Transform 
ax (t) + Bx(t)) aX, (f) + BX2(f) 
X(t) x(-f) 
x(at) ty (4) 
lal \a 
x(t — to) eftnXx (f) 
e?e x(t) ' X(f- fo) 
x(t) x y(t) X(f)¥(f) 
x(t)y(t) X(f) *¥(f) 
d ; a: 
nee j2ufx(f) 
da" eee 
l Ir tt) (j2nf)"X (P) 
URS Pcs 
tx(t) 4 (ż) ape 
n (iy E 
t" x(t) l (ż) PTAA 
i X(f) 1 
f xdr a ON) 


transform consists of impulse functions in the frequency domain. In this section, we will 
show that this property holds for all periodic signals; in fact, there exists a close relationship 
between the Fourier transform of a periodic signal and the Fourier-series representation of 
the signal. 

Let x(t) be a periodic signal with the period To. Let {x,} denote the Fourier-series 
coefficients corresponding to this signal. Then 


cas a 
jan tt 
x(t) = J Xne TT", 


n=—0O - 


By taking the Fourier transform of both sides and using the fact that 


gerne] = (+ 2 z) | 


X= Y md (s = z). (2.3.57) 


n=—0O 


we obtain 


From this relation, we observe that the Fourier transform of a periodic signal x(t) consists 
of a sequence of impulses in frequency at multiples of the fundamental frequency of the 
periodic signal. The weights of the impulses are simply.the Fourier-series coefficients of 
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the periodic signal. This relation gives a shortcut for computing Hourier-series coefficients 
of a signal using the properties of the Fourier transform. If we define the truncated signal 
XN (t) as : 


xt) -B<r<% 


x7, (t) = 2.3.58 
ay) 0 otherwise i ( ) 
we see that 
[oe] 
x(t)= J. tnt- nT). (2.3.59) 
n=- - 
Noting that xn (t — nTo) = xr (t) x ô(t —nTo), we have 
Cc 
x(t) = xr (t) * D ô(t —nTp). (2.3.60) 
n=—00 


Therefore, using the convolution theorem and Table 2.1, we obtain 


X(f) = Xn(f) È » s(+-4)]. (2.3.61) 


n=—0o 


which simplifies to 


oo 


au n n 
X(f) ao So Xn ($) (s- z) (2.3.62) 


Comparing this result with 


X(f)= YO xô (7 = z). (2.3.63) 


n=—00 


we conclude that 


1 n 
n=—X =l. 2.3.64 
x, Ti To (5) ( ) 


This equation gives an alternative way to find the Fourier-series coefficients. 
Given the periodic signal x(t), we can find x, by using the following steps: 


1. First, we determine the truncated signal x7,(t) using Equation (2.3.58). 


2. Then, we determine the Fourier transform of the truncated signal using Table 2.1 and 
the Fourier-transform theorems and properties. 


3. Finally, we evaluate the Fourier transform of the truncated signal at f = T and scale 
it by T as shown in Equation (2.3.64). 
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Example 2.3.22 
Determine the Fourier-series coefficients of the signal x(t), as shown in Figure 2.25. 


Solution We follow the preceding steps. The truncated signal is 
t 
x(t) = (=) 
t 


Xn(f) = tsine(tf). 


and its Fourier transform is 


Therefore, 


2.3.4 Transmission over LTI Systems 


The convolution theorem is the basis for the analysis of LTI systems in the frequency 
domain. We have seen that the output of an LTI system is equal to the convolution of 
the input and the impulse response of the system. If we translate this relationship in the 
frequency domain using the convolution theorem, then X(f), H(f), and Y(f) are the 
Fourier transforms of the input, system impulse response, and the output, respectively. 
Thus, 


Y(f) =X(f)H(f). 


Hence, the input-output relation for an LTI system in the frequency domain is much sim- 
pler than the corresponding relation in the time domain. In the time domain, we have the 
convolution integral; however, in the frequency domain we have simple multiplication. 
This simplicity occurs because the signal’s Fourier transformation represents it in terms 
of the eigenfunctions of LTI systems. Therefore, to find the output of an LTI system for 
a given input, we must find the Fourier, transform of the input and the Fourier transform 
of the system impulse response. Then, we must multiply them to obtain the Fourier trans- 
form of the output. To get the time-domain representation of the output, we find the inverse 
Fourier transform of the result. In most cases, computing the inverse Fourier transform is 
not necessary. Usually, the frequency-domain representation of the output provides enough 
information about the output. f 


Example 2.3.23 
Let the input to an LTI system be the signal 


x(t) = sinc(W)t), 
and let the impulse response of the system be 
\ k h(t) = sinc(W2t ). 


Determine the output signal. 
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Figure2.44 Lowpass signal and lowpass 
filter. 





Solution First, we transform the signals to the frequency domain. Thus, we obtain 


= ho(g) 
and 
w= aa): 


Figure 2.44 shows X(f) and H(f). To obtain the output in the frequency domain, we have 


Y(f) = X(f)H(f) 


pee T J 
= ww” (#,)"(£) 


1 £ 
mw l! (4) W < W2 








= ; (2.3.65) 
wall (4) W, > Wp 
From this result, we obtain 
wsinc(W1t) W, < W 
yt) = m 


w sinc(Wot) Wi > W l 


Signals such as x(t) in the preceding example are called lowpass signals. Lowpass 
signals are signals with a frequency domain representation thatcontains frequencies around 
the zero frequency and does not contain frequencies beyond some Wj. Similarly, an LTI 
system that can pass all frequencies less than some W and rejects all frequencies beyond 
W is called an ideal lowpass filter (LPF). An ideal lowpass filter will have a frequency 
response that is 1 for all frequencies -W < f < W and is O outside this interval. W 
is the bandwidth of the filter. Similarly, we can have ideal highpass filters. In a highpass 
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Figure 2.45 Various filter types. 


filter (HPF), H ( f ), there is unity outside the interval -W < f < W and zéro inside. Ideal 
bandpass filters (BPF) have a frequency response that is unity in some interval W1 < |f| < 
W3 and zero otherwise. In this case, the bandwidth of the filter is W3 — W1. Figure 2.45 
shows the frequency-response functions of various filter types. 

For nonideal lowpass or bandpass filters, the bandwidth is usually defined as the 
bànd of frequencies at which the power-transfer ratio of the filter is at least half of the 
maximum power-transfer ratio. This bandwidth is usually called the 3 dB bandwidth of the 
filter, because reducing the power by a factor of 2 is equivalent to decreasing it by 3 dB on 
the logarithmic scale. Figure 2.46 shows the 3 dB bandwidth of filters. 

It is worth emphasizing that the bandwidth of a filter is always the set of positive 
frequencies that a filter can pass. 


Example 2.3.24 
The magnitude of the wansfer function of a filter is given by 


IH(f)| = 


Determine the filter type and its 3 dB bandwidth. 


Solution At f = 0, we have |H(f)| = 1 and |H(f)| decreases and tends to zero as f tends 
to infinity. Therefore, this is a lowpass filter. Since power is proportional to the square of the 
amplitude, the equation must be 
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3 dB bandwidth 


Figure 2.46 3 dB bandwidth of filters in Example 2.3.24. 


1 i 
IHP = —_——,, = 


f 
1+ (hss) 
This yields fọ = £10,000. Therefore, this is a lowpass filter with a 3 dB bandwidth of 10 kHz. 
A plot of |H (f)| is shown in Figure 2.47. a 


=f 
s Figure 2.47 3 dB bandwidth of 
x 10 filter in Example 2.3.24. 





Section 2.4 Filter Design : 85 
2.4 FILTER DESIGN 


In communication systems, filters are widely used to separate desired signals from unde- 
sired signals and interference. Usually, the desired signal characteristics are specified in 
the frequency domain, in terms of the desired magnitude and phase response of the filter. 
In the filter design process, we determine the coefficients of a causal filter that closely 
approximates the desired frequency response specifications. 

There are a variety of filter types, both analog and digital. We are particularly inter- 
ested in the design of digital filters, because they are easily implemented in software on a 
computer. Digital filters are generally classified as having either a finite duration impulse 
response (FIR) or an infinite duration impulse response (IIR). An FIR filter is an all-zero 
filter that is characterized in the z-domain by the system function 


M-1 
H) = $ hw, 
k=0 


where {h(k),0 < k < M — 1} is the impulse response of the filter. The frequency response 
of the filter is obtained by evaluating H(z) on the unit circle, i.e., by substituting z = e}? 
in H(z) to yield H (w). In the discrete-time domain, the FIR filter is characterized by the 
(difference) equation 


M-1 
y(n) = $ h(k)x(n —k), 


k=0 


where {x()} is the input sequence to the filter and y(n) is the output sequence. 
An IIR filter has both poles and zeros, and it is generally characterized in the z-domain 
by the rational system function ' 


Ma! b(k)z™ 
L= De a(k) 


where {a (k)} and {b(k)} arethe filter coefficients. The frequency response H (w) is obtained 
by evaluating H (z) on the unit circle. In the discrete-time domain, the IIR filter is charac- 
terized by the difference equation 


H(z) = 


N M-1 
y(n) =J alkyn- k) + Y bxh — k). 
k=1 k=0 


In practice, FIR filters are employed in filtering problems where there is a require- 
ment of a linear-phase characteristic within the passband of the filter. If there is no require- 
ment for a linear-phase characteristic, either an IIR or an FIR filter may be employed. In 
general, if some phase distortion is either tolerable or unimportant, an IIR filter is prefer- 
able, primarily because the implementation involves fewer coefficients and consequently 
has a lower computational complexity. In the communication systems that are considered 
in this text, phase distortion is very undesirable; hence, our focus will be on the design 
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Figure 2.48 Magnitude characteristics of physically realizable filters. 


of linear-phase FIR filters. A typical frequency-response specification for a causal digital 
filter is illustrated in Figure 2.48. 

The filter is a lowpass filter with the passband-edge frequency fp = œ p/27, stopband- 
edge frequency fs = ws/2n, passband ripple 5,, and stopband ripple ô2. The transition of 
the frequency response from passband to stopband defines the transition band of the filter. 
The width of the passband, 0 < f < fp, is usually called the bandwidth of the filter. 

Often, the graph of the frequency response of a filter has a large dynamic range, 
and to accommodate this range, it is common practice to use a logarithmic scale for the 
magnitude |H (f)|. Consequently, the ripple in the passband is 20 logy ce , and the ripple 
in the stopband is 20 log; 52. i 

In a filter design problem, we usually specify several filter parameters (1) the maxi- 
mum tolerable passband ripple, (2) the maximum tolerable stopband ripple, (3) the 
passband-edge frequency fp, and (4) the stopband-edge frequency fs. On the basis of 
these specifications, we select the filter coefficients that are closest to the desired frequency 
response specifications. 

MATLAB provides several functions for designing digital filters based on various 
design methods. Our focus will be on the design and implementation of digital FIR filters 
(lowpass filters, Hilbert-transform filters, and differentiator filters) based on the Remez 
algorithm, which provides optimum equiripple, linear-phase FIR filters. The Remez algo- 
rithm requires that we specify the length of the FIR filter, the critical frequencies fp and f, 
and the ratio 52/5,. However, it is more natural in filter design to specify fp, fs, 51, and 52 
and to determine the filter length that satisfies the specifications. Given the specifications, 
a simple formula for approximating the filter length M is 


ae —20 log, (5152) — 13 
~ 14.6Af 
where Af is the transition bandwidth Af = fs — fp. 


+1, (2.4.1) 
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Example 2.4.1 i 
Use the “remez” function to design an FIR-lowpass filter that meets the following specifica- 
tions: 
Passband ripple < 1 dB, 
Stopband attenuation > 40 dB, 
Passband-edge frequency = 0.2, 
Stopband-edge frequency = 0.35. 


1. Estimate the length of the FIR filter using Equation (2.4.1). 
2. Plot the magnitude frequency response 201log,, |H (f)|- 


Solution From the preceding data, the ripple in the passband is given by 201ogj) ry. This 


l= 
is equal to 1 dB, so we have 


1 +ô: 
1 = 201 —. 
0810 i ôi 
Solving for 5), we get 
6, = 0.0575. 


On the other hand, the attenuation in the stopband is measured with respect to the maximum 
passband gain, i.e., 1 + ô}. Therefore, we have 


3 
20 logig ET = —40. 


Substituting 5; = 0.0575 and solving for 52, we obtain a 
ô = 0.0106. 
We can also determine the value of A f to be 
Af = fs — fp = 0.35 — 0.2 = 0.15. 


Now that we know the values of 5), 62, and Af, we can use Equation (2.4.1) to find the length 
of the filter M: z; 


Therefore, we design a filter of length 10. 
The MATLAB command used here is the remez command, which has a syntax of the 
form 


h=remez (M, f, m,w) 


where M is the order of the filter (note that the order of the filter is related to the length of 
the filter by M through M = M — 1) and f is a vector of frequencies (the first component is 
equal to 0 and the last component is equal to 1). The components of f are in increasing order 
and specify the important frequencies, such as the end of the passband and the beginning of 
the stopband. Vector m is a vector whose size is equal to the size of vector f and contains 
the desired magnitude response of the filter corresponding to the values of f. Vector w is a 
weighting vector; in the passband, it is equal to 2, and in the stopband, it is equal to 1. In our 
example, we have 
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f=0 0.1 0.15 1] 
m=[1 1 0 0] 


0.0106 
0575 ~ 


1 = [0.1839 1). 


Using these results and calling 
>> [h]=remez (M, £,m,w) 


at the MATLAB prompt returns the impulse response of the desired filter. Once we have the 
impulse response, we can use the MATLAB freqz command to determine the frequency 
response of the filter. MATLAB’s complete set of commands for designing this filter are as 
follows: 


>> fp=0.2; 

>> fs=0.35; 

>> df=fs-fp; 

>> Rp=1; 

>> As=40; 

>> deltal=(10* (Rp/20)-1)/ (10% (Rp/20) +1); 

>> delta2=(1+deltal1) «(10% (-As/20)); 

>> Mhat=ceil ( (-20*1l0g10 (sqrt (deltal*delta2) ) -13)/ 
(14.6*d£)+1); 

>> f=[0 fp fs 1]; 

>> m=[1 1 0 0]; 

>> w=[delta2/deltal 1]; 

>> h=remez (Mhat+20,£,m,w); 

>> [H,W]=freqz(h,[1],3000); 

>> plot(W/pi,20*10g10 (abs (H) ) ) 


The transfer function of the resulting filter, in decibels, is plotted in Figure 2.49. a 





-120 Figure 2.49 Frequency response of 
0 0.2 0.4 0.6 0.8 1 the designed filter. 
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2.5 POWER AND ENERGY 


The concepts of power and energy, as well as power-type and energy-type signals, were 
defined in Section 2.1.2. In this section, we expand these concepts both in the time and 
frequency domains. 

The energy and power of a signal represent the energy or power delivered by the 
signal when it is interpreted as a voltage or current source feeding a 1 Q resistor. The 
energy content of a (generally complex-valued) signal x(t) is defined as 


E = J Ix(t)|?de, 
(o 0] 


and the power content of a signal is 


1 +3 
’ P, = lim — / |x(t)|7dt. 
T>œ T = 


A signal is energy-typeif €x < oo, and itis power-typeif0 < P, < oo. A signal cannot 
be both power- and energy-type because Py = 0 for energy-type signals, and y = oo for 
power-type signals. A signal can be neither energy-type nor power-type. An example of 
such a signal is given in Problem 2.10. However, most of the signals of interest are either 
energy-type or power-type. Practically, all periodic signals are power-type and have power 


1 atTo 5 
P, == t)|dt, 
ef ko 
where To is the period and « is any arbitrary real number. 


2.5.1 Energy-Type Signals 


For an energy-type signal x (t), we define the autocorrelation function 


Rx(t) = x(t) *x*(—T) 


= a x(t)x*(t — t)dt 


[o0] 


a f EONA (2.5.1) 


~90 
By setting t = O in the definition of the autocorrelation function of x(t), we obtain its 
energy content, i.e., r 


E= / j lx (t)|?dt 


(oe) 


= R,(0). (2.5.2) 


Using the autocorrelation property of the Fourier transform (see Section 2.3.2), we derive 
the Fourier transform of R,(t) to be |X (f)|*. Using this result, or employing Rayleigh’s 
theorem, we have 
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CO 
Ex = i Ix(t)|?de 
— 00 


= | IX(P) Pdf. (2.5.3) 


If we pass the signal x(t) through a filter with the (generally complex) impulse response 
h(t) and frequency response H (f), the output will be y(t) = x(t) * h(t) and in the fre- 
quency domain ¥ (f) = X(f) H(f). To find the energy content of the output signal y(t), 
we have 


$,= J Iyar 
Š / IY (f)Paf 


z / IX(APIA Raf 


= R,(0), (2.5.4) 


where R(t) = y(t) * y*(—T) is the autocorrelation function of the output. The inverse 
Fourier transform of |Y (f)|? is 


Ry(t) = F" [IY (PY) 
= F [IXP IHA] 
2g [IXP] F~ [AP] 
= R,(t) * R,(t), (2.5.5) 


where (a) follows from the convolution property and (b) follows from the autocorrelation 
property. Now let us assume that 


1 W<f<W+t+AW 
A(f)= I . 
0 otherwise 
Then 
IX(f)? W<f<W+AW 
Ees {XO : 
0 otherwise 
and 


by =| woprar 


x |X(W)/7AW. (2.5.6) 
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This filter passes the frequency components in a small interval around f = W, and rejects 
alltheother components. Therefore, the output energy represents the amount energy located 
in the vicinity of f = W in the input signal. This means that |X (W)|?AW is the amount 
of energy in x(t), which is located in the bandwidth [W, W + AW]. Thus, 


Energy in [W, W + AW] bandwidth 


2- 
IX(W)|° = AW 


This shows why |X(f)I* is called the energy spectral density of a signal x(t), and why 
it represents the amount of energy per unit bandwidth present in the signal at various fre- 
quencies. Hence, we define the energy spectral density (or energy spectrum) of the signal 
x(t) as 


Gf) =X? 
= ¥[R,(t)]. (2.5.7) 


To summarize, 


1. For any energy-type signal x(t), we define the autocorrelation function R,(t) = 
x(T) *x*(—T). 

2. The energy spectral density of x(t), denoted by %, (f), is the Fourier transform of 
R,(t). It is equal to |X (f)I?. 

3. The energy content of x(t), x, is the value of the autocorrelation function evalu- 
ated at tT = 0 or, equivalently, the integral of the energy spectral density over all 
frequencies, i.e., 


Ex = Rx (0) 
= J &(f) df. (2.58) 


4. If x(t) is passed through a filter with the impulse response h(t) and the output is 
denoted by y(t), we have 


r 


y(t) = x(t) * h(t) 
Ry(t) = Rx(T) * Ra(t) 


Gf) = GAGS) = IX APIA. 
Example 2.5.1 


Determine the autocorrelation function, energy spectral density, and energy content of the 
signal x(t) = eu; (t), œ > 0. 


Solution First we find the Fourier transform of x(t). From Table 2.1, we have 


X(f) = : 
~ at jlnf’ 

Hence, 

1 


z 2 
BN =O = sree 
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and 
1 
R,(t) = F! [ixe] = a e, 


To find the energy content, we can simply find the value of the autocorrelation function at 
zero: 


E= RO= 5. = 


Example 2.5.2 
If the signal in the preceding example is passed through a filter with impulse response h(t) = 
e~*'y_4(t), B > 0, B # a, determine the autocorrelation function, the power spectral density, 
and the energy content of the signal at the output. 


Solution The frequency response of the filter is 


1 


"O= BS iinF 


Therefore, 


YP = XP? 
1 
© QPF Am? f)(B? + ax? f?) 
o 1 1 1 | 
o P-a p B+ An? f? |’ 


Note that in the last step we used partial fraction expansion. From this result, and using 
Table 2.1, we obtain 


1 1 1 a 
Ry(t) — pza Ee ee] __ 2B. Bl | 


and 


Sy = R,(0) 
_ 1 
— 20B(a + B) 


2.5.2 Power-Type Signals 


For the class of power-type signals, a similar development is possible. In this case, we 
define the time-average autocorrelation function of the power-type signal x(t) as 


1 tt 
R,(t) = lim — x(t)x*(t — Tt) dt. (2.5.9) 
T>00 T == 
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Obviously, the power content of the signal can be obtained from 


1 tt 
P, = lim — f \x(t)| dt 
T>œ T SE 
2z 
(2.5.10) 


= R,(0). 
We define Sx(f), the power-spectral density or the power spectrum of the signal x(t), to 
be the Fourier transform of the time-average autocorrelation function: 
Sx(f) = F[Ry(7)]. (2.5.11) 
Subsequently, we will justify this definition. Now we can express the power content 
of the signal x(t) in terms of S,(f) by noting that R,(0) = SES Sx (f)e? df l=0 = 


(2.5.12) 


SE S&(f)df, ie., 
Py = R,(0) 


= f SD) df. 


If a power-type signal x(t) is passed through a filter with impulse response A(t), the 


output is es 
y(t) = i x(t)h(t —t)dt 

—=00 
and the time-average autocorrelation function for the output signal is 
4 
RO= Jim = | , y= oe 
Jar 


(oe) 


Substituting for y(t), we obtain 
+f œœ foe} 
[/ h(u)x(t — u) au| [/ h*(v)x*(t — t — v) dv 


R = lim — 
y(t) pai T 2 5 
By making a change of variables w = t — u and changing the order of integration, we 


obtain 
Ry(t) = / / h(u)h* (v) 
T +u 
[x (w)x* (u +w-—t—v) dw] dudv 


. 1 

x lim — 

Too T J_T_, 
2 


= ae Ry(t + v —u)h(ujh*(v) dudv 


[Rx(t + v) «h(t + v)]h* (v) dv 
(2.5.13) 


ay i 


£ R,(t) * h(t) *h*(—1), 
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where (a) uses the definition of Ry, given in Equation (2.5.9), and (b) and (c) use the 
definition of the convolution integral. Taking the Fourier transform of both sides of this 
equation, we obtain 


S (F) = Sx(f)H(f)H"*(f) 
= Sx PUHAN. (2.5.14) 


This relation between the input--output power-spectral densities is the same as the relation 
between the energy-spectral densities at the input and the output of a filter. Now we can 
use the same arguments used for the case of energy-spectral density; i.e., we can employ an 
ideal filter with a very small bandwidth, pass the signal through the filter, and interpret the 
power at the output of this filter as the power content of the input signal in the passband of 
the filter. Thus, we conclude that S,(f), as just defined, represents the amount of power at 
various frequencies. This justifies the definition of the power-spectral density as the Fourier 
transform of the time-average autocorrelation function. 

We have already seen that periodic signals are power-type signals. For periodic sig- 
nals, the time-average autocorrelation function and the power-spectral density simplify 
considerably. Let us assume that the signal x(t) is a periodic signal with the period To and 
has the Fourier-series coefficients {x,}. To find the time-average autocorrelation function, 
we have 


ee Oe ne eer 
RO= Jim = | x(t)x"(t —t)dt 


TZ 


im T d 
= lin gy ie x(t)x*(t — t) dt 
To 


= lim = ie = SietGewa 
1 pt? s 
a Í 7 x@xte sod (2.5.15) 


This relation gives the time-average autocorrelation function for a periodic signal. If we 
substitute the Fourier-series expansion of the periodic signal in this relation, we obtain 


foe] 
R,(t) = if? D 5 Xn XS, tT e o dt, (2.5.16) 


n=—00 m=—00 


Now, using the fact that 
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we obtain 


o0 
Re(t) = > enle, (2.5.17) 


n=—00 


From this relation, we see that the time-average autocorrelation function of a periodic 
signal is itself periodic; it has the same period as the original signal, and its Fourier- 
series coefficients are magnitude squares of the Fourier-series coefficients of the original 
signal. 

To determine the power-spectral density of a periodic signal, we can simply find the 
Fourier transform of R,(t). Since we are dealing with a periodic function, the Fourier 
transform consists of impulses in the frequency domain. We expect this result, because a 
periodic signal consists of a sum of sinusoidal (or exponential) signals; therefore, the power 
is concentrated at discrete frequencies (the harmonics). Thus, the power spectral density of 
a periodic signal is given by 


Se(f) = D> ixn (r - z, (2.5.18) 


n=—00 


To find the power content of a periodic signal, we must integrate this relation over the 
whole frequency spectrum. When we do this, we obtain 


œœ 
P, = D lxnl?. 


n=—00 


This is the same relation we obtained in Section 2.2.3. If this periodic signal passes through 
an LTI system with the frequency response H (f), the output will be periodic and the power 
spectral density of the output can be obtained by employing the relation between the power 
spectral densities of the input and the output of a filter. Thus, 


Sf) =1HCA IP Y baal? (4-3) 


*N=—00 


n s n 
H (=) ô i - =) (2.5.19) 


and the power content of the output signal is 


ioe) 
= al 


n=—00 





œo 


P,= >> enl 


n=—00 








2.6 HILBERT TRANSFORM AND ITS PROPERTIES 


In this section, we will introduce the Hilbert transform of a signal and explore some of 
its properties. The Hilbert transform is unlike many other transforms because it does not 
involve a change of domain. In contrast, Fourier, Laplace, and z-transforms start from 
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the time-domain representation of a signal and introduce the transform as an equivalent 
frequency-domain (or more precisely, transform-domain) representation of the signal. The 
resulting two signals are equivalent representations of the same signal in terms of two 
different arguments, time and frequency. Strictly speaking, the Hilbert transform is not 
a transform in this sense. First, the result of a Hilbert transform is not equivalent to the 
original signal, rather it is a completely different signal. Second, the Hilbert transform 
does not involve a domain change, i.e., the Hilbert transform of a signal x(t) is another 
signal denoted by X(t) in the same domain (i.e., time domain with the same argument t). 

The Hilbert transform of a signal x(t) is a signal X(t) whose frequency compo- 
nents lag the frequency components of x(t) by 90°. In other words, £(t) has exactly 
the same frequency components present in x(t) with the same amplitude—-except there 
is a 90° phase delay. For instance, the Hilbert transform of x(t) = Acos(2n fot + 8) is 
Acos(2n fot + 0 — 90°) = Asin(2n fot + 0). 

A delay of = atall frequencies means that e/?*/' will become e/?"f0'— 2 = — jes? fot 
and e~/2*fot will become e~/2@"fo'—2) = jei2" Jot. In other words, at positive frequencies, 
the spectrum of the signal is multiplied by —j; at negative frequencies, it is multiplied by 
+j. This is equivalent to saying that the spectrum (Fourier transform) of the signal is mul- 
tiplied by — jsgn( f ). In this section, we assume that x (t) is real and has no DC component, 
i.e., X(f)\ po = 0. 

Therefore, 





F [X(t)] = —jsgn(f)X (f). (2.6.1) 
Using Table 2.1, we have 
eda ate 
F([—jsgn(f)] = a 


Hence, 
© x(t) 





X(t) = 4, *xx(t) = F 
wt T 


Lek ee 


dt. (2.6.2) 


Thus, the operation of the Hilbert transform is equivalent to a convolution, i.e., fil- 
tering. 
Example 2.6.1 

Determine the Hilbert transform of the signal x(t) = 2sinc (2t). 


Solution We use the frequency-domain approach to solve this problem. Using the scaling 
property of the Fourier transform, we have 


Ftray)a2xinl(f\en(f)= 1 =el 
swo=2x3n(2)=n(4)=n(r+3)+n(s 5) 


In this expression, the first term contains all the negative frequencies and the second term 
contains all the positive frequencies. 

To obtain the frequency-domain representation of the Hilbert wansform of x(t), we use 
the relation F [x(t)] = —jsgn(f) F [x(t)], whichresults in 


en any TEE 
sie@=in(s+5) in(4 5): 
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Taking the inverse Fourier wansform, we. have 


E(t) = je /™sinc(t) — je’™ sinc(t) 


=-j (e™" — e J") sinc(t) 
= —j x 2j sin(mt)sinc(t) 
= 2sin(xt) sinc(t). E 


Obviously performing the Hilbert transform on a signal is equivalent to a 90° phase 
shift in all its frequency components. Therefore, the only change that the Hilbert transform 
performs on a signal is changing its phase. Most important, the amplitude of the frequency 
components of the signal, and, therefore the energy and power of the signal, do not change 
by performing the Hilbert-transform operation. On the other hand, since performing the 
Hilbert transform changes cosines into sines, it is not surprising that the Hilbert transform 
X(t) of a signal x(t) is orthogonal to x(t).’ Also, since the Hilbert transform introduces 
a 90° phase shift, carrying it out twice causes a 180° phase shift, which can cause a sign 
reversal of the original signal. These are some of the most important properties of the 
Hilbert transform. In all the following results, we assume that X(f) does not have any 
impulses at zero frequency. 


Evenness and Oddness. The Hilbert transform of an even signal is odd, and the 
Hilbert transform of an odd signal is even. 
Proof. If x(t) is even, then X(f) is a real and even function; therefore, 
—j sgn(f)X(f) is an imaginary and odd function. Hence, its inverse Fourier transform 
X(t) will be odd. If x(t) is odd, then X(f) is imaginary and odd; thus — j sgn(f)X (f) is 
real and even. Therefore, x(t) is even. 


Sign Reversal. Applying the Hilbert-transform operation to a signal twice causes 
a sign reversal of the signal, i.e., 


R(t) = —x(t). (2.6.3) 
Proof. Since . 3 
FIA] = [—jsgn(f PX (S), (2.6.4) 
it follows that NE a 
F (X(t) = —X(f), (2.6.5) 


where X(f) does not contain any impulses at the origin. 


Energy. The energy content of a signal is equal to the energy content of its Hilbert 
transform. 


Proof. Using Rayleigh’s theorem of the Fourier transform, we have 


Ex =f |x(@) Pde =f IX (Paf (2.6.6) 


7x(t) and y(t) are orthogonal if f$ x(t)y*(t) dt = 0. 
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and 


ie / Pat = ii | —jsen(f)X (FPA. (2.6.7) 


o0 


Using the fact that | — jsgn(f)|? = 1 except for f = 0, and the fact that X (f) does not 
contain any impulses at the origin completes the proof. 
Orthogonality. The signal x(t) and its Hilbert transform are orthogonal. 


Proof. Using Parseval’s theorem of the Fourier transform, we obtain 


J xs) dr = f X (fy jsen(f)X (PV af 


o0 


0 +00 
-e J IX(P)Paf + j Í IX (Paf 


—00 


=0, (2.6.8) 


where, in the last step, we have used the fact that X (f) is Hermitian; therefore, |X (f)|* 
is even. 


2.7 LOWPASS AND BANDPASS SIGNALS 


A lowpass signal is a signal in which the spectrum (frequency content) of the signal is 
located around the zero frequency. A bandpass signal is a signal with a spectrum far from 
the zero frequency. The frequency spectrum of a bandpass signal is usually located around 
a frequency f+, which is much higher than the bandwidth of the signal (recall that the 
bandwidth of a signal is the set of the range of all positive frequencies present in the signal). 
Therefore, the bandwidth of the bandpass signal is usually much less than the frequency 
Jc, which is close to the location of the frequency content. 

The extreme case of a bandpass signal is a single frequency signal whose frequency 
is equal to f,. The bandwidth of this signal is zero, and generally, it can be written as 


x(t) = Acos(2nf,t + 0). 
This is a sinusoidal signal that can be represented by a phasor 
x= Ae”. 
Notethat A is assumed to be positive and the range of 0 is —x to x, as shown in Figure 2.50. 


Im 


2afo Ael? 


Re Figure 2.50 Phasor corresponding to a sinusoid signal. 
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The phasor has a magnitude of A and a phase of 0. If this phasor rotates coun- 
terclockwise with an angular velocity of wœ; = 2v f, (equivalent to multiplying it by 
ei?n fet), the result would be Ae/@*/c'+9) Tts projection on the real axis (its real part) is 
x(t) = Acos(2n fet + 0). 

We can expand the signal x(t) as 


x(t) = A cos(2x fet + 0) (2.7.1) 
= A cos(0) cos(2x fet) — A sin(0) sin(2x fot) (2.7.2) 
= x, COS(2N fct) — xs sin(2n fot). (2.7.3) 


From our previous discussion, we can see that this single-frequency signal has two compo- 
nents. The first component is x, = Acos(@), which is in the direction of cos(21 fst). This 
is called the in-phase component. The other component is x, = A sin(9), which is in the 
direction of — sin(2x fet). This is called the quadrature component. Note that we can also 
write 

xy = Ae? = x, + jxs. (2.7.4) 


Now assume that instead of the phasor shown in Figure 2.50, we have a phasor with 
slowly varying magnitude and phase. This is represented by 


x(t) = Ate, (2.7.5) 


where A(t) and @(t) are slowly varying signals (compared to fe). In this case, similar to 
Equation (2.7.3), we have 


x(t) = Re [A (thei Ore t9O)) (2.7.6) 
= A(t) cos(@(t)) cos(2n fet) — A sin(6(t)) sin(2n fet) (2.7.7) 
= x(t) cos(2n fet) — xs (t) sinQu fct). (2.7.8) 


Unlike the single frequency signal previously studied, this signal contains a range of fre- 

quencies; therefore, its bandwidth is not zero. However, since the amplitude (also called the 

envelope) and the phase are slowly varying, this signal’s frequency components constitute 

a small bandwidth around f,. The spectrum of the bandpass signal is shown in Figure 2.51. 
In this case, the in-phase and quadrature components are 


x(t) = A(t) cos(O(t)) (2.7.9) 
xs(t) = A(t) sin(O(t)) (2.7.10) 
X(f) 
(\ 
-f-W-f. fe + W -W fe fo+W f 


Figure 2.51 Spectrum of the bandpass signal. 
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and we have 
x(t) = x,(t) cos(2n fet) — xs (t) sinQ2n fet). (2.7.11) 


Note that both the in-phase and quadrature components of a bandpass signal are slowly 
varying signals; therefore, they are both lowpass signals. 

Equation (2.7.11) is a very useful relation; it basically says that a bandpass signal 
can be represented in terms of two lowpass signals, namely, its in-phase and quadrature 
components. 

In this case, the complex lowpass signal 


x(t) = Xe(t) + jxs(t) (2.7.12) 


is called the lowpass equivalent of the bandpass signal x(t). If we represent x;(t) in polar 


coordinates, we have ous 
. Xs(t 
x(t) = yf x2(t) + x2 (te? Few, 


Now if we define the envelope and the phase of the bandpass signal as 


l(t) = A(t) = yx) + x(t) 





s(t 2.7.13 
Zx,(t) = 6(t) = arctan Xs a ( ) 
Xe(t) 
we can express x(t) as 
xlt) = AH. (2.7.14) 
Using Equations (2.7.14) and (2.7.11), we have 
x(t) = Re [xte] (2.7.15) 
= Re [Apel tO) (2.7.16) 
= A(t) cos(2nf,t + 0(t)). (2.7.17) 


Equations (2.7.17) and (2.7.11) represent two methods for expressing a bandpass 
signal in terms of two lowpass signals. We can express the signal in terms of the in-phase 
and quadrature components or in terms of the envelope and phase of the bandpass signal. 


2.8 SUMMARY AND FURTHER READING 


This chapter provided a review of basic representation and classification of signals and the 
analysis of linear systems. Our weatment included the Fourier series representation of peri- 
odic signals and the response of linear time-invariant (LTI) systems to periodic signals. Our 
treatment also included the Fourier transform, its properties, and its use in the frequency 
domain characterization of signals and LTI systems. Additional topics treated were the 
representation of lowpass and bandpass signals, power-type and energy-type signals, the 
design of digital filters using MATLAB, and the Hilbert transform and its properties. The 
Hilbert transform plays an important role in the modulation and demodulation of analog 
and digital signals. 
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Numerous references cover the analysis of LTI systems in both the time and fre- 
quency domains. The book by Oppenheim, Willsky, and Young (1983) contains wide cov- 
erage of time- and frequency-domain analysis of both discrete-time and continuous-time 
systems. Papoulis (1962) and Bracewell (1965) provide in-depth analysis of the Fourier 
series and transform techniques. The book by Ingle and Proakis (2012) covers design tech- 
niques for digital filters using MATLAB. 


PROBLEMS 


2.1 Plot the following signals: 


1. x, (t) = I (2t + 5) 

2. x(t) = X o A(t — n) 
3. x3(t) = sgn(2t) — sgn(t) 
4. x4(t) = sinc(10t) 


2.2 Plot the discrete version of the given signals. Assume they are sampled at multiples 
of To, i.e., x[n] = x(n To). 


1. x(t) = sinc(3t) and To = 


Ole 





2t-1 1 
2. x(t) =] and Tọ = 8 


3. x(t) = tu (t) — (t — l)u-ı(t — 1) and Tp = ; 


2.3 Two signals, x;(t) = 1 and x2(t) = cos 2xt, are sampled att = 0 1, +2,....The 
resulting discrete-time signals are denoted by xı[n] and x[n]. Verify that x;[n] = 
x2[n]. What can you conclude from this observation? 


2.4 Show that the sum of two discrete periodic signals is periodic, whereas the sum of 
two continuous periodic signals is not necessarily periodic. Under what condition is 
the sum of two continuous periodic signals periodic? 


2.5 Determine whether the given signals are periodic. For periodic signals, determine the 
period. 


1. sin(4000xż) + cos(11,000z1) 
2. sin(4000xt) + cos(11,0002) 
3. sin(40001mm) + cos(11,00017) 
4. sin(40001n) + cos(11,0007) 


2.6 Classify the signals that follow into even signals, odd signals, and signals that are 
neither even nor odd. In the latter case, find the even and odd parts of the signals. 
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e“ t>0 
1. x(t) = -€e t¢<0 
0 t=0 


2. x(t) = cos (12071 + =} 


3. x3(t) = e™"! 
t t>0 
4. x4(t) = a 
an ' <0 
5. x5(t) = xı (t) — x2(t), where x; (t) is even and x2(t) is odd 
2.7 Classify these signals into energy-type signals, power-type signals, and signals that 
are neither energy-type nor power-type signals. For energy-type and power-type sig- 
nals, find the energy or the power content of the signal. 
» X1(t) = (e™ cost) u1 (t) 
e X2(t) = e™ cost 
. x3(t) = sgn(t) 
. x4(t) = Acos2n fit + Bcos2n fot 


Aa U N “e 


2.8 Classify these signals into periodic and nonperiodic: 


1L 4 (3) =2 E oA (=) — reco A(t — 4n) 


2. x2(t) = sint + sin 27t 


3. x3[n] = sinn 
2.9 Using the definition of power-type and energy-type signals, 


1. Show that x(t) = Ae/@*fo'+®) is a power-type signal and its power content 
is A?. 
2 
2. Show that x(t) = A cos(27 fot + 0) is power type and its power is D 


3. Show that the unit-step signal u_;(t) is a power-type signal and find its power 
content. 


4. Show that the signal 


1 
Kii t>0 
PT? eee 
0 +1<0 


is neither an energy-type nor a power-type signal. 


2.10 Find the even and odd parts of the signal x(t) = A (t)u—ı (t). 
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2.11 Using the definition of even and odd signals, 


1. 
2. 


Show that the decomposition of a signal into even and odd parts is unique. 


Show that the product of two even or two odd signals is even, whereas the prod- 
uct of an even and an odd signal is odd. 


. Try to find an example of two signals that are neither even nor odd, but have an 


even product. 


2.12 Plot the following signals: 


a kw N = 


- x(t) = TI) + N(-2) 

» X2(t) = AMIN) 

e x306) = re _ A(t — 2n) 
. x4(t) = sgn(t) + sgn] — t) 
. x5(t) = sinc(t) sgn(t) 


2.13 By using the properties of the impulse function, find the values of these expressions: 


1. 
2. 
3. 
4. 
5. 
6. 


7. 
8. 


xı (t) = sinc(t) (t) 

x(t) = sinc(t) 6(¢ — 3) 

x3(t) = A(t) x DoS ô(t — 2n) 
x4 (t) = A(t) x 6’(t) 

xst) = cos (2¢ + 2) 860) 
x6(t) = (5t) x ô (4t) 

SS sinc(t) 8(t) dt 

SS sinc(t + 1) 6(t) dt 


2.14 Show that the impulse signal is even. What can you say about evenness or oddness 
of its nth derivative? 


2.15 Wehave seen that x(t) x 6(t) = x(t). Show that 


and 


x(t) *d™(t) = Zaw 


t 
x(t) dt. 
CO 


x(t)x u(t) = / 


2.16 Classify these systems into linear and nonlinear: 


1. 
2. 


y(t) = 2x(t) —3 
yt) = lx) 
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3. y(t) =0 

4. y(t) = 2° 
so= {soe 
6. y(t) =e x(t) 

7. y(t) = x(t)u- (t) 

8. y(t) =x) + yt — 1) 
9 


. y(t) = Algebraic sum of jumps in x(t) in the interval [—oo, t] 


2.17 Prove that a system is linear if and only if 


2.18 


2.19 


2.20 


2.21 


2.22 


1. It is homogeneous, i.e., for all input signals x (t) and all real numbers a, we have 
T [ax(t)] = aF [x(t)]. 


2. It is additive, i.e., for all input signals x, (t) and x2(t), we have 


F(x (t) +O) = FIO) + Fla]. 


In other words, show that the two definitions of linear systems given by Equations 
(2.1.39) and (2.1.40) are equivalent. 


Verify whether any (or both) of the conditions described in Problem 2.17 are satisfied 
by the systems given in Problem 2.16. 


Prove thatif a system satisfies the additivity property described in Problem 2.17, then 
it is homogeneous for all rational a. 





Show that the system described by 
Ct), 
x(t) #0 
y(t) = 4 x'@) 
0 x(t) =0 


is homogeneous but nonlinear. Can you give another example of such a system? 


Show that the response of a linear system to the input which is identically zero is an 
output which is identically zero. 


The system defined by the input-output relation 


y(t) = x(t) cos(2x fot), 


where fo is a constant, is called a modulator. Is this system linear? Is it time 
invariant? 
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2.23 Are these statements true or false? Why? 


1. A system whose components are nonlinear is necessarily nonlinear. 


2. A system whose components are time variant is necessarily a time-variant 
system. 


3. The response of a causal system to a causal signal is itself causal. 
2.24 Determine whether the following systems are time variant or time invariant: 


- y(t) = 2x(t) +3 

~ y(t) = (t + 2)x(t) 

- y(t) = x(—t) 

- y(t) = x(t)u_i(t) 

. yt) = f x(t) dt 

- yt) = x(t) + yt- 1) 


An bk U NY = 


2.25 Prove that if the response of an LTI system to x(t) is y(t), then the response of this 
system to £x(t) is £ y(t). 


2.26 Prove that if the response of an LTI system to x(t) is y(t), then the response of this 
t : t 
system to f œ X(T) dt is f_ œ y(t) dt. 


2.27 The response of an LTI system to the input x(t) = e~™u_, (t) is ô(t). Using time- 
domain analysis and the result of Problem 2.25, determine the impulse response of 
this system. What is the response of the system to a general input x(t)? 


2.28 Let a system be defined by 


t+T 


1 
= — at. 
y(t) = IT |, x(t) dt 


Is this system causal? 


2.29 For an LTI system to be causal, it is required that h(t) be zero for t < 0. Give an 
example of a nonlinear system which is causal, but its impulse response is nonzero 
fort < 0. 


2.30 Determine whether the impulse response of these LTI systems is causal: 


1. A(t) = sinc(t) 


t—3 
2. h(t) =T1 =) 


2.31 Using the convolution integral, show that the response of an LTI system to u—; (t) is 
given by f“ A(t) dt. 
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2.32 What is the impulse response of a differentiator? Find the output of this system to an 


arbitrary input x (t) by finding the convolution of the input and the impulse response. 
Repeat for the delay system. 


2.33 The system defined by 
t 
y(t) = i x(t) dt. 
t-T 


(T is a constant) is a finite-time integrator. Is this system LTI? Find the impulse 
response of this system. 


2.34 Compute the following convolution integrals: 


1. e'u_(t)xe™u_(t) 
2. I(t) * A(t) 


"2.35 Show that in a causal LTI system, the convolution integral reduces to 


+00 t 
y(t) = i x(t —t)h(t) dt = | x(t)h(t — t) dt. 
0 o0 


in 
2.36 Show that the set of signals Y, (t) = Ke 275" constitutes an orthonormal set of 
signals on the interval [œ, a + To], where a is arbitrary. 


2.37 In this problem, we present the proof of the Cauchy—Schwartz inequality. 


1. Show that for nonnegative {a;}?_, and {B;}?_), 


n n 5 n 5 
Rew =| Ee] [Ea 
i= i=1 i= 


What are the conditions for equality? 


2. Let {x;}7_, and {y;}7_, be complex numbers. Show that 


n n n 
S aye} ast = DS ally? 
i=l i=1 i=1 


What are the conditions for equality? 
3. From (1) and (2), conclude that 


n n ; n 3 
yxy} aba [Zwe] l 


What are the conditions for equality? 
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4. Generalize the preceding results to integrals and prove the Cauchy—Schwartz 


inequality 
1 aS 1 
< | J koar |" l / oP at). 


What are the conditions for equality? 





fi x(t)y*(t) dt 


2.38 Let {¢; (OJP be an orthogonal set of N signals, i.e., 


[aooaa], TI Gergen, 
0 0 i=j 


and let x (t) be an arbitrary signal. Let è (t) = Dri a;@; (t) be alinear approximation 
of x(t) in terms of {¢ġ; (ODHA ,- Find @;’s such that 


e= I xE) — 20) dt 


is minimized. 
1. Show that the minimizing &;i’s satisfy 


Qi = F x(t); (t) dt. 


2. Show that with the preceding choice of @;’s, we have 


oo N 
dy = f AP dt — ) lal. 


=o i=1 
2.39 Determine the Fourier-series expansion of the following signals: 


1. x(t) = cos(2nt) + cos(4nt) > 

. x(t) = cos(2nt) — cos(4nt + 1/3) 

. x(t) = 2cos(2nt) — sin(4nt) 

. x(t) = A(t 2n) 

~ X(t) = reg A(t —n)u_i(t —n) 

- x(t) = |cos 27 fot | (full-wave rectifier output) 


An db WN 


2.40 Show that for a real, periodic x(t), the even and odd parts are given by 


foe} 
a n 
x(t) = = + J An COS (2x21) ; 


n=l 


© 
. n 
s Xo(t) = D an Sin (2x21) ; 


n=1 
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2.41 Let x, and yn represent the Fourier-series coefficients of x(t) and y(t), respectively. 
Assuming the period of x(t) is Tọ, express y, in terms of x, in each of the following 
cases: 

1. y(t) = x(t — to) 
2. y(t) = x(t)es?™ Jor 
3. y(t) =x(at),a #0 


2.42 Let x(t) and y(t) be two periodic signals with the period Tọ, and let x, and y, denote 
the Fourier-series coefficients of these two signals. Show that 


1 a+To oo 
A / x(t)y*(t)dt = X xry% 
0 Ja 


n=—oo 


2.43 Determine the Fourier-series expansion of each of the periodic signals shown in 
Figure P-2.43. For each signal, also determine the trigonometric Fourier series. 


x(t) x(d) 





Figure P-2.43 
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2.44 Determine the output of each LTI system when the signals in Problem 2.43 are the 
inputs: 


= f 
1. H(f) = 10n (4) 


2: 
—j O<f<4 
A(f)={ j -4<f <0 


0 otherwise 


2.45 Show that for all periodic physical signals that have finite power, the coefficients of 
the Fourier-series expansion x, tend to zero as n —> +00. 


2.46 Determine the Fourier transform of each of the following signals: 


1 
. xt) = Gz 


. A(t — 3) +N¢ +3) 
. 411 (£) cos(2x fot) 

. t sinct 

5. tcos 2r fot 


A U Nme 


2.47 The Fourier transform of a signal is shown in Figure P-2.47. Determine and 
sketch the Fourier transform of the signal xı (t) = —x(t) + x(t) cos(2000xżt) + 
2x(t) cos? (30007ż). 


1X0) 





Figure P-2.47 


2.48 Show that the Fourier transform of £8 (t + $) + Łô (t — 4) is cos(x f). Prove the 


following transform pairs: 
i 1 1 1 1 
FẸ = -ô = —6 peoa 
F [cos(7xt)] 5 (4 + 5) + 7 (1 z) 


and 


l l i 
F(sin(nt)] = ia (s+ 5) = Ís (1 = 3) } 
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2.49 Determine the Fourier transform of the signals shown in Figure P-2.49. 


x(t) x(t) 





—1 f 
(c) (d) 
x(t) x(t) 
1 sin (2x fot) 
1 
-2 -1 1 2 1 l 1 
(e) 2fo (£) 2fo Figure P-2.49 


2.50 Use the convolution theorem to show that 
sinc(t) x sinc(t) = sinc(t). 


2.51 Prove that convolution in the frequency domain is equivalent to multiplication in the 
time domain. That is, 


F[x(t)y(t)] = X(f) * Y(f). 
2.52 Let x(t) be an arbitrary signal and define xı (t) = peices x(t —nTp). 
1. Show that x;(t) is a periodic signal. 
2. How can you write xı (t) in terms of x(t) and Do arak ôlt-nD)? 


3. Find the Fourier transform of xı (t) in terms of the Fourier transform of x(t). 


2.53 Using the properties of the Fourier transform, evaluate the following integrals (œ is 
positive): 
co. 5 
1. f2 sinc? (t) dt 
2. foe sinc(t) dt 
3. jee e~™ cos( Br) dt 
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2.54 An LTI system with impulse response A(t) = e~*u_1(t) is driven by the input 
x(t) = e-*tu_\(t). Assume that a, 8 > 0. Using frequency-domain analysis, find 
the output of the system. Is the output power type or energy type? Find its power or 
energy. 


2.55 Let x(t) be periodic with period Tọ, and letO < œ < Tp. Define 


(1) x(t) a<t<a+T 
x = 
5 0 otherwise 


and let X,(f) denote the Fourier transform of x,(t). Prove that for all n, XalZ) is 
independent of the choice of a. 


2.56 Using the identity 


[o.e} 


5 §¢ nl) = > > PT, 


n=—00 S n=-00 


show that for any signal x(t) and any T;, the following identity holds 
= Lie n 2Qnt 
_ A 2 ier 
XY 26 nls) = 5 Dfi Ts 
n=—00 n=—00 
From this, derive the following relation known as Poisson’s sum formula: 


foe) 1 co 
Donez D xli) 


n=—00 n=—0O 
2.57 Using the Poisson’s sum formula discussed in Problem 2.56, show that 


ae 
foe) 2a _ ro —a|n| T 
1. peace a2+4n2n2 Lun=—0o E 


2. Jre- Sine(S) = K forall K e {1,2,...} 
3. o_o sine’*(4) = K  forallK e (1,2,...}. 
2.58 The response of an LTI system to e~“u_j(t), where (œ > 0), is 5(t). Using 


frequency-domain analysis techniques, find the response of the system to x(t) = 
e7% cos(Bt)u_(t). 


2.59 Find the output of an LTI system with impulse response h(t) when driven by the 
input x (t) in each of the following cases: 


1. A(t) = sinc(t) x(t) = sinc(t) 


2. A(t) = (t) + 6(t) x(t) =e, (æ >0) 


3. h(t) =e u_(t) x(t) =e"Fu_i(t), (a, B > 0) 
(treat the special case a = B separately) 
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2.60 Can the response of an LTI system to x(t) = sinc(t) be y(t) = sinc? (t)? Justify your 
answer. 


2.61 Let the response of an LTI system to TI (t) be A(t). 


1. Can you find the response of this system to x(t) = cos 2mt from the preceding 
information? i 


2. Show that h(t) = II(t) and A2(t) = TI(t) + cos 2m¢ can both be impulse 
responses of this system, and therefore having the response of a system to IT(¢) 
does not uniquely determine the system. 


3. Does the response of an LTI system to u—; (t) uniquely determine the system? 
Does the response to e™%u_; (t) for some œ > 0? In general, what conditions 
must the input x(t) satisfy so that the system can be uniquely determined by 
knowing its corresponding output? 


. 2.62 Show that the Hilbert transform of A sin(2n fot + 0) is —A cos(2n fot + 8). 
2.63 Show that the Hilbert transform of the signal e/?*40' is equal to — jsgn( fo)e Aot 
2.64 Show that eee. 

a[o] = 21| FIF x(O). 


2.65 Show that the Hilbert transform of the derivative of a signal is equal to the derivative 
of its Hilbert transform. 


2.66 The real narrowband signal x(t), whose frequency components are in the neighbor- 
hood of some fọ (and — fo), is passed through a filter with the transfer function 
H(f), and the output is denoted by y(t). The magnitude of the transfer function is 
denoted by A( f), and its phase is denoted by 6(/). Assume that the transfer function 
of the filter is so smooth that in the bandwidth of the input signal, the magnitude of 
the transfer function is essentially constant and its phase can be approximated by its 
first-order Taylor-series expansion, i.e., 


A(f) © Alfo) 
OCP) © O( fo) + F = fo)" (Ai pas 


1. Show that Y,(/), the Fourier transform of the lowpass equivalent of the output, 
can be written as 


fe ji 
¥i(f) © XNA, (fo)+f pas). 
2. Conclude that 


y(t) © A(fo) + Vilt — te) cos(27 fot — tp), 
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where V, (t) is the envelope of the input x(t) and 


nL ve” 

Ba 2x df = 
=fo 

font he) 

P m f P 


3. The quantities t, and t, are called envelope delay (or group delay) and phase 
delay, respectively. Can you interpret their role and justify this nomenclature? 


2.67 We have seen that the Hilbert transform introduces a 90° phase shift in the compo- 
nents of a signal and the transfer function of a quadrature filter can be written as 


e-i? f>0 
H(f) = {0 f=0. 
ei? f <0 


We can generalize this concept to a new transform that introduces a phase shift of 0 
in the frequency components of a signal, by introducing 


ei? f>0 
He(f) = 40 f =0 
e? Ff <0 


and denoting the result of this transform by xø (t), i.e., Xo (f) = X(f)Hoe(f), where 
Xo(f) denotes the Fourier transform of xe (t). Throughout this problem, assume that 
the signal x(t) does not contain any DC components. 


1. Find hg (t), the impulse response of the filter representing this transform. 
2. Show that x@(t) is a linear combination of x(t) and its Hilbert transform. 


3. Show that if x(t) is an energy-type signal, x(t) will also be an energy-type 
signal and its energy content will be equal to the energy content of x(t). 


COMPUTER PROBLEMS 


2.1 Fourier Series 


The purpose of this problem is to evaluate and plot the Fourier-series coefficients of 
a periodic signal. The periodic signal x(t), with period Tọ, is defined as 


t A, t|<t 
x) = amt) = rl < A 
2to 0 otherwise 


for |t| < To/2, where to < To/2. A plot of x(t) is shown in Figure CP-2.1. Let 
A=1, h = 4, and t = 1. 
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Figure CP-2.1 Signal x(t). 
1. Demonstrate mathematically that the Fourier-series coefficients in the expansion 
of x(t) are given as 


1, 
Xn = ~sinc 


: (5) = sin(nn/2) 


nn 
2. Use MATLAB to plot the original signal x(t) and the Fourier-series approxima- 


tion of x(t) over one period for n = 1, 3, 5, 7, and 9. Note that as n increases, 
the approximation becomes closer to the original signal x (t). 


3. Plot the discrete magnitude spectrum |x,| and the phase spectrum /x, for 
In| < 20. 
2.2 Filtering of a Periodic Signal 


The objective of this problem is to demonstrate the effect of passing a periodic signal 
through an LTI system. The periodic signal x(t) is a triangular pulse train with period 
To = 2 and defined over a period as 
t+1 -1<t<0 
A@) = f-t+1 0<1t<1 
0 otherwise 
The frequency response characteristic of the LTI filter is 
1 
1+ f? 
1. Demonstrate mathematically that the Fourier-series coefficients in the expansion 
of x(t) are given as 


H(f) = 


t= tie (5) 
toeg 27` 
2. Use MATLAB to plot the spectrum {xn} for 0 < |n| < 10. 


3. Since the fundamental frequency of the input signal x(t) is Fo = 1/7) = 1/2, 
the Fourier-series coefficients of the filter output are yn = x,H(n/2). Plot the 
discrete spectrum {yn} of the filter output. 


4. On asingle graph, plot the Fourier-series approximation of the filter output sig- 
nal for n = 1, 3,..., 9. Comment on the results. 
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2.3 Fourier Transform 


Plot the magnitude and the phase spectra of the two signals shown in Figure CP-2.3. 
Notg that the signal x2(t) is a time-shifted version of xı (t). Therefore, we expect the 
two signals to have identical magnitude spectra. 


i. x(t) j. x(t) 





—1 


=. 
Cn 


Figure CP-2.3 Signals x, (¢) and x(t). 


2.4 LTI System Analysis in the Frequency Domain 
The signal 


e“ t>0 
t)= = 
a c t<0 


is passed through a lowpass filter, whose frequency response is specified as `’ 


cos(x f/3) |f| < 1.5 Hz 
0 otherwise ` 


a=] 


= 


. Determine the magnitude and phase spectra of the input signal x(t) and plot 
these on two separate graphs. 


N 


. Determine and plot the-magnitude and phase spectra of the output signal y(t). 


w 


. Use MATLAB to compute the inverse Fourier transform of the output signal and 
plot it. 


2.5. Bandpass to Lowpass Transformation 


The signal 
x(t) = sinc(100r) cos(400xt) 


is a bandpass signal whose frequency content is centered at fo = 200 Hz. 


1. Plot x(t) and the magnitude spectrum |X (f)|. 


2. With fo = 200 Hz, determine the equivalent lowpass signal and plot its magni- 
tude spectrum. Also, plot the in-phase and quadrature components of x(t) and 
the envelope of x (t). 
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2.6 FIR Filter Design 
Use the remez function in MATLAB to design the lowest order (smallest length) 
FIR lowpass filter that satisfies the following specifications: 


Passband: 0 < w < 0.41 
Stopband: 0.51 <w < x 
Passband ripple: 0.5 dB 
Stopband attenuation: 40 dB. 


1. Plot the impulse response coefficients of the FIR filter. 
2. Plot the magnitude and phase of the frequency response of the filter. 


2.7 FIR Hilbert Transform Filter 


Use the remez function to design an FIR Hilbert-transform filter that satisfies the 
following specifications: 


Passband: 0.lx < |w| < 0.5x 
Stopband: 0.62 < |w| < x 
Passband ripple: 5; = 0.01 
Stopband attenuation: ô = 0.01. 
1. Plot the impulse-response coefficients of the FIR filter. 


2. Plot the magnitude of the frequency response of the filter. 
2.8 FIR Differentiator 


Use the remez function to design a 25-tap FIR differentiator that has a passband in 
therange 0.1m < w < 0.6n. 


1. Plot the impulse response of the filter. 
2. Plot the magnitude of the frequency response. 


3. Generate 100 samples of the sinusoid 
ut 
x(n) = Ssin(n), n=0,1,... 


and pass them through the differentiator. Plot the filter output y(n) and compare 
y(n) with x(n). Note that there is a 12-sample delay in the FIR filter. 
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A large number of information sources produce analog signals. Analog signals can be mod- 
ulated and transmitted directly, or they can be converted into digital data and transmitted 
using digital-modulation techniques. The notion of analog-to-digital conversion will be 
examined in detail in Chapter 7. 

Speech, music, images, and video are examples of analog signals. Each of these 
signals is characterized by its bandwidth, dynamic range, and the nature of the signal. For 
instance, in the case of audio and black-and-white video, the signal has just one component, 
which measures air pressure or light intensity. But in the case of color video, the signal 
has four components, namely, the red, green, and blue color components, plus a fourth 
component for the intensity. In addition to the four video signals, an audio signal carries 
the audio information in color-TV broadcasting. Various analog signals exhibit a large 
diversity in terms of signal bandwidth. For instance, speech signals have a bandwidth of 
up to 4 kHz, music signals typically have a bandwidth of 20 kHz, and video signals have a 
much higher bandwidth, about 6 MHz. 

In spite of the general trend toward the digital transmission of analog signals, we 
still have a significant amount of analog signal transmission, especially in audio and video 
broadcast. In Chapters 3 and 4, we treat the transmission of analog signals by carrier mod- 
ulation. (The treatment of the performance of these systems in the presence of noise is 
deferred to Chapter 6.) We consider the transmission of an analog signal by impressing it 
on the amplitude, the phase, or the frequency of a sinusoidal carrier. Methods for demod- 
ulation of the carrier-modulated signal to recover the analog information signal are also 
described. This chapter is devoted to amplitude modulation (AM) systems, where the 
message signals change the amplitude of the carrier. Chapter 4 is devoted to phase- and 
frequency-modulation systems, in which either the phase or the frequency of the carrier is 
changed according to variations in the message signal. 
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3.1 INTRODUCTION TO MODULATION 


The analog signal to be transmitted is denoted by m(t), which is assumed to be a lowpass 
signal of bandwidth W; in other words M(f) = 0, for|f| > W. The power content of this 
signal is denoted by 
P, li 1 fe 2 
m= pm T E |m(t)|* dt. 


The message signal m(t) is transmitted through the communication channel by 
impressing it on a carrier signal of the form 


c(t) = A, cos(27 fet + $c); (3.1.1) 


where A, is the carrier amplitude, f, is the carrier frequency, and ġe is the carrier phase. 
The value of ¢, depends on the choice of the time origin. Without loss of generality, we 
assume that the time origin is chosen such that ġe = 0. We say that the message signal 
`m(t) modulates the carrier signal c(t) in either amplitude, frequency, or phase if after 
modulation, the amplitude, frequency, or phase of the signal becomes a function of the 
message signal. In effect, modulation converts the message signal m(t) from lowpass to 
bandpass, in the neighborhood of the carrier frequency fe. 
Modulation of the carrier c(t) by the message signal m(t) is performed to achieve 
one or more of the following objectives: 


(1) To translate the frequency of the lowpass signal to the passband of the channel so 
that the spectrum of the transmitted bandpass signal will match the passband char- 
acteristics of the channel. For instance, in transmission of speech over microwave 
links in telephony transmission, the transmission frequencies must be increased to 
the gigahertz range for transmission over the channel. This means that modula- 
tion, or a combination of various modulation techniques, must be used to trans- 
late the speech signal from the low-frequency range (up to 4 kHz) to the gigahertz 
range. 


(2 


wa 


To simplify the structure of the transmitter by employing higher frequencies. For 
instance, in the transmission of information using electromagnetic waves, transmis- 
sion of the signal at low frequencies requires huge antennas. Modulation helps trans- 
late the frequency band to higher frequencies, thus requiring smaller antennas. This 
simplifies the structure of the transmitter (and the receiver). 

(3) To accommodate for the simultaneous transmission of signals from several message 

sources, by means of frequency-division multiplexing (FDM). (See Section 3.4.) 

(4) To expand the bandwidth of the transmitted signal in order to increase its noise and 
interference immunity in transmission over a noisy channel, as we will see in our 
discussion of angle modulation in Chapter 6. 


Objectives (1), (2), and (3) are met by all of the modulation methods described in this 
chapter. Objective (4) is met by employing angle modulation to spread the signal m(t) 
over a larger bandwidth, as discussed in Chapter 4. j 
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In the following sections, we consider the transmission and reception of analog sig- 
nals by carrier AM. We will compare these modulation methods on the basis of their 
bandwidth requirements and their implementation complexity. Their performance in the 
presence of additive noise disturbances and their power efficiency will be discussed in 
Chapter 6. 


3.2 AMPLITUDE MODULATION 


In amplitude modulation, the message signal m(t) is impressed on the amplitude of the 
carrier signal c(t) = A, cos(2v fst). This results in a sinusoidal signal whose amplitude 
is a function of the message signal m(t). There are several different ways of amplitude 
modulating the carrier signal by m(t); each results in different spectral characteristics 
for the transmitted signal. We will describe these methods, which are called (a) double- 
sideband suppressed-carrier (DSB-SC) AM, (b) conventional double-sideband AM, (c) 
single-sideband (SSB) AM, and (d) vestigial-sideband (VSB) AM. 


3.2.1 Double-Sideband Suppressed-Carrier AM 


A DSB-SC AM signal is obtained by multiplying the message signal m(t) with the carrier 
signal c(t) = A, cos(2nf,t). Thus, we have the amplitude-modulated signal 


u(t) = m(t)c(t) 
= A,-m(t) cos(2x fet). 


Examples of message signal m(t), carrier signal c(t), and modulated signal u(t) are shown 
in Figure 3.1. This figure shows that a relatively slowly varying message signal m(t) is 
changed into a rapidly varying modulated signal u(t), and due to its rapid changes with 
time, it contains higher-frequency components. At the same time, the modulated signal 
retains the main characteristics of the message signal; therefore, it can be used to retrieve 
the message signal at the receiver. 


Spectrum of the DSB-SC AM Signal. The spectrum of the modulated signal 
can be obtained by taking the Fourier transform of u(t) and using the result of Example 
2.3.14. Thus, we obtain 


Ac 
U(f)= z IMG — f+ ME + fo). 


Figure 3.2 illustrates the magnitude and phase spectra, respectively, for M(f) and U(f). 

The magnitude of the spectrum of the message signal m(t) has been translated 
or shifted in frequency by an amount f,. Furthermore, the bandwidth occupancy of the 
amplitude-modulated signal is 2W, whereas the bandwidth of the message signal m(t) is 
W. Therefore, the channel bandwidth required to transmit the modulated signal u(t) is 
B. = 2W. 
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m(t) 


c(t) = A cos(2nf t) 


en 
TATA 


u(t) = m(A)c(t) 


AAA gl | 
STAT ee TTT 


Figure 3.1 Examples of message, carrier, and DSB-SC-modulated signals. 


The frequency content of the modulated signal u(t) in the frequency band |f| > fe 
is called the upper sideband of U(f), and the frequency content in the frequency band 
IfI < fc is called the lower sideband of U(f). It is important to note that either one of 
the sidebands of U (f ) contains all the frequencies that are in M(f). That is, the frequency 
content of U(f) for f > fe corresponds to the frequency content of M(f) for f > 0, and 
the frequency content of U (f) for f < — fe corresponds to the frequency content of M(f) 
for f < 0. Hence, the upper sideband of U(f) contains all the frequencies in M(f). A 
similar statement applies to the lower sideband of U (f ). Therefore, the lower sideband of 
U(f) contains all the frequency content of the message signal M (f). Since U (f) contains 
both the upper and the lower sidebands, it is called a double-sideband (DSB) AM signal. 

The other characteristic of the modulated signal u(t) is that it does not contain a car- 
rier component. That is, all the transmitted power is contained in the modulating (message) 
signal m(t). This is evident from observing the spectrum of U (f). As long as m(t) does 
not have any DC component, there is no impulse in U (f) at f = fe; this would be the case 
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IM(f)| ZM(f) 
A 





f.-W-f.f+W 0 f-W +w 
LU(f) 
-f. i Figure 3.2 Magnitude and phase 
spectra of the message signal m(t) 
0 f- f and the DSB AM-modulated signal 
u(t). 


if a carrier component was contained in the modulated signal u(t). For this reason, u(t) is 
called a suppressed-carrier signal. Therefore, u(t) is a DSB-SC AM signal. 


Example 3.2.1 
Suppose that the modulating signal m(t) is a sinusoid of the form 


m(t) = a cos 27 fmt Tin K fe. 
Determine the DSB-SC AM signal and its upper and lower sidebands. 
Solution The DSB-SC AM is expressed in the time domain as 


u(t) = m(t)c(t) = Aca cos(27 fmt) cos(2n fet) 





Ac cos[2n( fe + fm)t]. 


Ac 
= = cos[2n( fe = Sint] + 


Taking the Fourier transform, the modulated signal in the frequency domain will have the 
following form: 


A,a 
4 LECS — fe + fm) HEC + fe - fm)) 


Ac 
+ 2 BQ - fo — fa) +E + fot Fadl 


U(f)= 





This spectrum is shown in Figure 3.3(a). The lower sideband of u (t) is the signal 


Aca 


7 cos[2n( fe — fm)t], 





ue(t) = 


122 Amplitude Modulation Chapter 3 


U(f) 








U(f) 
Aca 
ep 
ers 0 fh f 
(b) 
UP) 





-fe - fin 0 fe + Ín f 
(c) 


Figure 3.3 The (magnitude) spectrum of a DSB-SC AM signal for (a) a sinusoidal message signal and (b) its 
lower and (c) upper sidebands. 


and its spectrum is illustrated in Figure 3.3(b). Finally, the upper sideband of u(t) is the signal 


u,(t) = a cos[2n( fe + fm)#], 


and its spectrum is illustrated in Figure 3.3(c). a 


Example 3.2.2 


Let the message signal be m(t) = sinc(10*t). Determine the DSB-SC-modulated signal and 
its bandwidth when the carrier is a sinusoid with a frequency of 1 MHz. 


Solution In this example, c(t) = cos(2m x 10°t). Therefore, u(t) = sinc(10*t) cos(2n x 
106+). A plot of u(t) is shown in Figure 3.4. To obtain the bandwidth of the modulated signal, 
we first need to have the bandwidth of the message signal. Wehave M (f) = #sinc(10*t) = 
10-*11(10~* f). The Fourier transform is constant in the frequency range from —5000 Hz to 
5000 Hz, and it is zero at other frequencies. Therefore, the bandwidth of the message signal 
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-4 -2 0 2 pi Figure 3.4 Plot of 
x 10 u(t) = sinc(104t) cos(2m x 10°F). 


is W = 5000 Hz, and the bandwidth of the modulated signal is twice the bandwidth of the 
message signal, i.e., 10,000 Hz or 10 kHz. r 


Power Content of DSB-SC Signals. In order to compute the power content of 
the DSB-SC signal, we employ the definition of the power content of a signal given in 
Equation (2.1.11). Thus, 


T/2 


1 te 
= lim f A?m? (t) cos? (27 fet) dt 
-T/2, 
A2 1 T/2 p : 
= — lim z) m?(t)[1 + cos(47x f-t)] dt (3.2.1) 
2 


ee pe, (3.2.2) 


where Pm indicates the power in the message signal m(t). The last step follows from the 
fact that m?(t) is a slowly varying signal and when multiplied by cos(41f,t), which is 
a high-frequency sinusoid, the result is a high-frequency sinusoid with a slowly varying 
envelope, as shown in Figure 3.5. 

Since the envelope is slowly varying, the positive and the negative halves of each 
cycle have almost the same amplitude. Hence, when they are integrated, they cancel each 
other. Thus, the overall integral of m?(t) cos(4x f,t) is almost zero. This is depicted in 
Figure 3.6. Since the result of the integral is divided by T, and T becomes very large, the 
second term in Equation (3.2.1) is zero. 
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x(t) 


MDa 
MNN 


Figure 3.5 Plot of m? (t) cos(4nf,t). 


x(t) 


01d | 
WAN 


6 This figure shows why the second termin Equation (3.2.1) is zero. 





Figure 3. 


Example 3.2.3 
In Example 3.2.1, determine the power in 


sidebands. 


the modulated signal and the power in each of the 


Solution The message signal is m(t) = acos2n fnt. Its power was obtained in Example 


2.1.10 and Equation (2.1.12) as 


a 
Pm = 5? 
Therefore, 
2 2,2 
P,= Ac Pn = Ag 
EI 4 
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Because of the symmetry of the sidebands, the powers in the upper and lower sidebands, Ps 
and Ps, are equal and given by 





Demodulation of DSB-SC AM Signals. Suppose that the DSB-SC AM signal 
u(t) is transmitted through an ideal channel (with no channel distortion and no noise). Then 
the received signal is equal to the modulated signal, i.e., 


r(t) = u(t) 
= A.m(t) cos(27 f-t). (3.2.3) 


Suppose we demodulate the received signal by first multiplying r(t) by a locally 
generated sinusoid cos(2x fst + @), where ¢ is the phase of the sinusoid. Then, we pass the 
product signal through an ideal lowpass filter with the bandwidth W. The multiplication of 
r (t) with cos(2x fet + ġ) yields 


r(t)cos(2nf,t + $) = Aem(t) cos(2nf,t) cos(27 fet + Q) 
= SAn) cos(~) + Amo) cos(4n fet + ¢ġ). 


The spectrum of the signal is illustrated in Figure 3.7. Since the frequency content 
of the message signal m(t) is limited to W Hz, where W < fe, the lowpass filter can 
be designed to eliminate the signal components centered at frequency 2, and to pass 
the signal components centered at frequency f = 0 without experiencing distortion. An 
ideal lowpass filter that accomplishes this objective is also illustrated in Figure 3.7. Conse- 
quently, the output of the ideal lowpass filter is 


1 
yelt) = 5 Acm(t) cosl). (3.2.4) 


Note that m(t) is multiplied by ‘cos(); therefore, the power in the demodulated 
signal is decreased by a factor of cos? #. Thus, the desired signal is scaled in amplitude by 
a factor that depends on the phase ¢ of the locally generated sinusoid. When ¢ # 0, the 
amplitude of the desired signal is reduced by the factor cos(@). If p = 45°, the amplitude 
of the desired signal is reduced by /2 and the signal power is reduced by a factor of 2. If 
$ = 90°, the desired signal component vanishes. 


_Ideal lowpass filter 





Figure 3.7 Frequency-domain representation of 
the DSB-SC AM demodulation. 


126 Amplitude Modulation Chapter 3 


Transmitted 
mA modulated 
signal 
A,/A, 
A, cos2uf,t 
a i Figure 3.8 Addition of a pilot tone 
‘So Oscillator to a DSB-AM signal. 
Received 
signal 


r(t) 


Lowpass m(t) 
filter 


Narrowband 
filter tuned 


tofe demodulate a DSB-AM signal. 


Figure 3.9 Use of a pilot tone to 





The preceding discussion demonstrates the need for a phase-coherent or synchronous 
demodulator for recovering the message signal m(t) from the received signal. That is, the 
phase ¢ of the locally generated sinusoid should ideally be equal to 0 (the phase of the 
received-carrier signal). 

A sinusoid that is phase locked to the phase of the received carrier can be generated 
at the receiver in one of two ways. One method is to add a carrier component into the trans- 
mitted signal, as illustrated in Figure 3.8. We call such a carrier component “a pilot tone.” 
Its amplitude A, and its power A? /2 are selected to be significantly smaller than those of 
the modulated signal u(t). Thus, the transmitted signal is a double sideband, but it is no 
longer a suppressed carrier signal. At the receiver, a narrowband filter tuned to frequency 
fe filters out the pilot signal component; its output is used to multiply the received signal, 
as shown in Figure 3.9. We may show that the presence of the pilot signal results in a DC 
component in the demodulated signal; this must be subtracted out in order to recover m(t). 

Adding a pilot tone to the transmitted signal has a disadvantage: it requires that a 
certain portion of the transmitted signal power must be allocated to the transmission of 
the pilot. As an alternative, we may generate a phase-locked sinusoidal carrier from the 
received signal r (t) without the need of a pilot signal. This can be accomplished by the use 
of a phase-locked loop, as described in Section 8.8.1. 


3.2.2 Conventional Amplitude Modulation 


A conventional AM signal consists of a large carrier component, in addition to the DSB 
AM-modulated signal. The transmitted signal is expressed mathematically as 


u(t) = A-[1 + m(t)] cos(2n fet), (3.2.5) 
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Envelope 


ser A() 





Figure 3.10 A conventional AM signal in the time domain. 


where the message waveform is constrained to satisfy the condition that |m(t)| < 1. We 
observe that A-m(t) cos(21 fct) is a DSB-AM signal and A, cos(21 fet) is the carrier com- 
ponent. Figure 3.10 illustrates an AM signal in the time domain. As we will see later in 
this chapter, the existence of this extra carrier results in a very simple structure for the 
demodulator. That is why commercial AM broadcasting generally employs this type of 
modulation. 

As long as |m(t)| < 1, the amplitude A,[1 + m(t)] is always positive. This is the 
desired condition for conventional DSB AM that makes it easy to demodulate, as we will 
describe. On the other hand, if m(t) < —1 for some t, the AM signal is overmodulated and 
its demodulation is rendered more complex. In practice, m (t) is scaled so that its magnitude 
is always less than unity. 

It is sometimes convenient to express m (t) as 


m(t) = amz,(t), 


where m,(t) is normalized such that its minimum value is —1. This can be done, e.g., by 


defining 
m(t) 


TS max |m(t)|` 


In this case, the scale factor a is called the modulation index, which is generally a constant 
less than 1. Since |m, (t)| < 1and 0 < a < 1, we have 1 +am, (t) > 0, and the modulated 
signal can be expressed as 


u(t) = Ac [1 + am, (t)] cos 27 fet, (3.2.6) 


which will never be overmodulated. 


Spectrum of the Conventional AM Signal. If m(t) is a message signal with 
Fourier transform (spectrum) M (f), the spectrum of the amplitude-modulated signal 
u(t) is : 


U(f) = FlAcam (t) cos(2n fet)] + F[Ac cos(2n fct)] 


Ae Ac 
= < Malf = fe) + Mal f + fol + FCF — fe) + 8(F + fo. 
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Figure 3.11 Conventional AM in both the time and frequency domain. 


A message signal m(t), its spectrum M (f), the corresponding modulated signal u(t), and 
its spectrum U (f) are shown in Figure 3.11. Obviously, the spectrum of a conventional 
AM signal occupies a bandwidth twice the bandwidth of the message signal. 


d 
Example 3.2.4 


Suppose that the modulating signal m(t) is a sinusoid of the form 
m(t) =cos2mfint fm K fo 


Determine the DSB-AM signal, its upper and lower sidebands, and its spectrum, assuming a 
modulation index of a. 


Solution From Equation (3.2.6), the DSB-AM signal is expressed as 


u(t) = A-[1 + acos2u ft] cos(2n fet) 


= A,cos(2 fe 





= Sm)t] 


+ =< cos[2m( fe + fm)t]. 


The lower-sideband component is 


Ac 
ult) =  cosl2n(f.— fn, 
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while the upper-sideband component is 


Aca 


u(t) = -> cos[2r(fe + fm)t]. 





The spectrum of the DSB-AM signal u(t) is 
Ac 
U(f) = T [6(f — fo +6(f + f 
A,a 
+ 7. [6(f d fe + tm) +6(f + te = Sin)] 


+ = SCF = fe — fa) +SS + fet fu 


The magnitude spectrum |U (f)| is shown in Figure 3.12. It is interesting to note that 
the power of the carrier component, which is A2/2, exceeds the total power (A2a? /4) of the 
two sidebands because a < 1. a 


Power for the Conventional AM Signal. A conventional AM signal is similar 
to a DSB when m (t) is substituted with 1 +m, (t). As we have already seen in the DSB-SC 
case, the power in the modulated signal is [see Equation (3.2.2)] 


Az 


Py = Pm» 


where Pm denotes the power in the message signal. For the conventional AM, 


k 1 rrr 
P, = lim = J [l + am, (t)? dt 


7 1 T/2 zed 
= lim = / n [1 + a’m3(t)] dt, 


where we have assumed that the average of m,(t) is zero. This is a valid assumption for 
many signals, including audio signals. Therefore, for conventional AM, 


Pn = 1 +a’ Pm; 


hence, 





-fe — fm fe Sfo + fn 0 fe fm fe fet fm 


Figure 3.12 Spectrum of a DSB-AM signal in Example 3.2.4. 
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The first component in the preceding relation applies to the existence of the carrier, and 
this component does not carry any information. The second component is the information- 
carrying component. Note that the second component is usually much smaller than the first 
component (a < 1, |m,(t)| < 1, and for signals with a large dynamic range, Pm, & 1). 
This shows thatthe conventional AM systems are far less power efficient than the DSB-SC 
systems. The advantage of conventional AM is that it is easily demodulated. 


Example 3.2.5 


The signal m(t) = 3 cos(200xt) + sin(600z?) is used to modulate the carrier c(t) = cos(2 x 
10°t). The modulation index is a = 0.85. Determine the power in the carrier component and 
in the sideband components of the modulated signal. ` 


Solution The message signal is shown in Figure 3.13. First, we determine m,(t), the nor- 
malized message signal. In order to find m,(t), we have to determine max |m(t )|. To determine 
the extrema of m (t), we find its derivative and make it equal to zero. We then have 


m’ (t) = —600zx sin(200nt) + 600m cos(600z1t) 
= 0, 
which results in 


cos(600nt) = sin(200rt) 


= Cos (> _ 200r) . 


One solution of this equation is 800xt = at ort = Ta- Substituting this value into m(t), we 


obtain : 
m (=) = 3.6955, 


which is the maximum value of the signal m(t). Therefore, 
3cos(200nt) + sin(600m?r) 

3.6955 
= 0.8118 cos(200x¢) + 0.2706 sin(600nz). 


m,(t) F3 


N W 
CS 


—4 : Figure 3.13 The message signal in 
0 0.005 0.01 0.015 0.02 0.025 0.03 0.035 0.04 Example 3.2.5. 
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The power in the sum of two sinusoids with different frequencies is the sum of powers 
in them. Therefore, 


Pa = 


The power in the carrier component of the modulated signal is 
2 
= 


=0.5, 
2 


and the power in the sidebands is 


2 
fea? Pr = 3 x 0.85 x 0.3661 


= 0.1323. B 


Demodulation of Conventional DSB-AM Signals. The major advantage of 
conventional AM signal transmission is the ease in which the signal can be demodulated. 
There is no need for a synchronous demodulator. Since the message signal m(t) satisfies 
the condition |m(t)| < 1, the envelope (amplitude) 1 -+m (t) > 0. If we rectify the received 
signal, we eliminate the negative values without affecting the message signal, as shown in 
Figure 3.14. 

The rectified signal is equal to u(t) when u(t) > 0, and it is equal to zero when 
u(t) < 0. The message signal is recovered by passing the rectified signal through a lowpass 
filter whose bandwidth matches that of the message signal. The combination of the rectifier 
and the lowpass filter is called an envelope detector. 

Ideally, the output of the envelope detector is of the form 


d(t) = 81 + g2m(t), (3.2.7) 


where g; represents a DC component and g2 is a gain factor due to the signal demodulator. 
The DC component can be eliminated by passing d(t) through a transformer, whose output 
is gom(t). 





Figure 3.14 Envelope detection of a conventional AM signal. 


132 Amplitude Modulation Chapter 3 


The simplicity of the demodulator has made conventional DSB AM a practical choice 
for AM radio broadcasting. Since there are literally billions of radio receivers, an inexpen- 
sive implementation of the demodulator is extremely important. The power inefficiency of 
conventional AM is justified by the fact that there are few broadcast transmitters relative 
to the number of receivers. Consequently, it is cost-effective to construct powerful trans- 
mitters and sacrifice power efficiency in order to simplify the signal demodulation at the 
receivers. 


3.2.3 Single-Sideband AM 


In Section 3.2.1, we showed that a DSB-SC AM signal required a channel bandwidth of 
B. = 2W Hz for transmission, where W is the bandwidth of the message signal. However, 
the two sidebands are redundant. We will demonstrate that the transmission of either side- 
band is sufficient to reconstruct the message signal m(t) at the receiver. Thus, we reduce 
the bandwidth of the transmitted signal to that of the baseband message signal m(t). 

In the appendix at the end of this chapter, we will demonstrate that a single-sideband 
(SSB) AM signal is represented mathematically as 


u(t) = Apm(t) cos 27 fet F A,m(t) sin2n fet, (3.2.8) 


where m(t) is the Hilbert transform of m(t) that was introduced in Section 2.6, and the 
plus or minus sign determines which sideband we obtain. The plus sign indicates the lower 
sideband, and the minus sign indicates the upper sideband. Recall that the Hilbert transform 
may be viewed as a linear filter with impulse response h(t) = 1/mt and frequency response 


—j, f >0 
A(f)= Lf <0. (3.2.9) 
0, f =0 


Therefore, the SSB-AM signal u(t) may be generated by using the system configuration 
shown in Figure 3.15. 


m(t) 






Hilbert 
transform 








m(t) 


Figure 3.15 Generation of a lower SSB-AM 
signal. 
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mQ) Bandpass u(i) 
filter 


Ac cos 2xfet 


© Figure 3.16 Generation of an SSB-AM signal by 
filtering one of the sidebands of a DSB-SC AM signal. 


The method shown in Figure 3.15 employs a Hilbert-transform filter. Another 
method, illustrated in Figure 3.16, generates a DSB-SC AM signal and then employs a 
filter that selects either the upper sideband or the lower sideband of the DSB-AM signal. 


Example 3.2.6 
Suppose that the modulating signal is a sinusoid of the form 


m(t) =COS2n fmt, fm K fe- 
Determine the two possible SSB-AM signals. 
Solution The Hilbert transform of m (t) is 
m(t) = sin 2n fnt. (3.2.10) 


Hence, 


u(t) = A, COS 27 fmt cos 2x fet F Ae Sin 2N fmt sin 2r fet. (3.2.11) 
If we take the upper (—) sign, we obtain the upper-sideband signal 
Uu (t) = Ac cos 2n( fo + fmt. 


On the other hand, if we take the lower (+) sign in Equation (3.2.11), we obtain the lower- 
sideband signal 


ue(t) = Ac cos 27 (fe — Sant. 
The spectra of u,,(t) and ue(t) were previously given in Figure 3.3. E 
Demodulation of SSB-AM Signals. To recover the message signal m(t) in the 
received SSB-AM signal, we require a phase-coherent or synchronous demodulator, as was 


the case for DSB-SC AM signals. Thus, for the upper single-sideband (USSB) signal given 
in Equation (3A.7), we have 


r(t) cos(27 fst + $) = u(t) cos(2x fet + $) ` 
= FAm) cos ọ + SAA) sing 


+ double-frequency terms. (3.2.12) 
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By passing the product signal in Equation (3.2.12) through an ideal lowpass filter, the 
double-frequency components are eliminated, leaving us with š 


ye(t) = LAm coso + TARO sing. (3.2.13) 


Note that the phase offset not only reduces the amplitude of the desired signal m(t) 
by cos @¢, but it also results in an undesirable sideband signal due to the presence of M(t) 
in y(t). The latter component was not present in the demodulation of a DSB-SC signal. 
However, it is a factor that contributes to the distortion of the demodulated SSB signal. 

The transmission of a pilot tone at the carrier frequency is a very effective method 
for providing a phase-coherent reference signal for performing synchronous demodulation 
at the receiver. Thus, the undesirable sideband-signal component is eliminated. However, 
this means that a portion of the transmitted power must be allocated to the transmission of 
the carrier. 

The spectral efficiency of SSB AM makes this modulation method very attractive 
` for use in voice communications over telephone channels (wirelines and cables). In this 
application, a pilot tone is transmitted for synchronous demodulation and shared among 
several channels. 

The filter method shown in Figure 3.16, whichselects one of the two signal sidebands 
for transmission, is particularly difficult to implement when the message signal m(t) has a 
large power concentrated in the vicinity of f = 0. In such a case, the sideband filter must 
have an extremely sharp cutoff in the vicinity of the carrier in order to reject the second 
sideband. Such filter characteristics are very difficult to implement in practice. 


3.2.4 Vestigial-Sideband AM 


The stringent frequency-response requirements on the sideband filter in an SSB-AM sys- 
tem can be relaxed by allowing vestige, which is a portion of the unwanted sideband, to 
appear at the output of the modulator. Thus, we simplify the design of the sideband filter 
at the cost of a modest increase in the channel bandwidth required to transmit the signal. 
The resulting signal is called vestigial-sideband (VSB) AM. This type of modulation is 
appropriate for signals that have a strong low-frequency component, such as video signals. 
That is why this type of modulation is used in standard TV broadcasting. 

To generate a VSB-AM signal, we begin by generating a DSB-SC AM signal and 
passing it through a sideband filter with the frequency response H(f), as shown in 
Figure 3.17. In the time domain, the VSB signal may be expressed as 


u(t) = [A,m(t) cos2n fet] x h(t), (3.2.14) 


where A(t) is the impulse response of the VSB filter. In the frequency domain, the corre- 
sponding expression is 


Ae 
Ufp= zMU — f) + M(f + AHF). (3.2.15) 
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m(t) * Sideband u(t) 
filter VSB-AM signal 


A(f) 






A,cos 21f,t 


Figure 3.17 Generation of 
VSB-AM signal. 


u(t) Ideal Am(t) 
— lowpass [> 
filter 
Ils w 
cos 2nf,t 


Figure 3.18 Demodulation of VSB signal. 


To determine the frequency-response characteristics of the filter, we will consider the 
demodulation of the VSB signal u(t). We multiply u(t) by the carrier component cos 21 f,t 
and pass the result through an ideal lowpass filter, as shown in Figure 3.18. Thus, the 
product signal is 


v(t) = u(t) cos 27 fet, 
or equivalently, 


1 
VA= zU O — fe) + UF + fo)l. (3.2.16) 


If we substitute U (f) from Equation (3.2.15) into Equation (3.2.16), we obtain 
Ac 
v(f)= zMU = 2f) + M(f)A(F — fo) 


Ac 
+ gMO) HMF +2FAH(S + fo). (3.2.17) 


The lowpass filter rejects the double-frequency terms and passes only the components in 
the frequency range |f| < W. Hence, the signal spectrum at the output of the ideal lowpass 
filter is 


Ac 
Vf) = g YOHU — fo) + HF + fol. (3.2.18) 


The message signal at the output of the lowpass filter must be undistorted. Hence, 
the VSB filter characteristic must satisfy the condition 


A(f — fe) + H(f + fe) = constant |f| < W. (3.2.19) 


This condition is satisfied by a filter that has the frequency-response characteristics shown 
in Figure 3.19. 
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HP) 
-wW TN 0 fh tht AW F 
of f 


H(f— fe) + Hf + fo) 


H(f- fe 
N es 


~2fc “Wha 8 fa W 2fe f 


Figure 3.19. VSB filter characteristics. 


H(f) 
fem fa ; EFW 0 fez W fit fa f 
fe fe 


Figure 3.20 Frequency response of the VSB filter for selecting the lower sideband of the message signals. 


Wenote that H (f) selects the upper sideband and a vestige of the lower sideband. It 
has odd symmetry about the carrier frequency fs in the frequency range fe — fa < f < 
tc + fa, where fa is a conveniently selected frequency that is some small fraction of W, 
i.e., fa K W. Thus, we obtain an undistorted version of the transmitted signal. Figure 3.20 
illustrates the frequency response of a VSB filter that selects the lower sideband and a 
vestige of the upper sideband. 

In practice, the VSB filter is designed to have some specified phase characteristics. 
To avoid distortion of the message signal, the VSB filter should have a linear phase over 
its passband fe — fa < |f| < fo + W. 


Example 3.2.7 
Suppose that the message signal is given as 


m(t) = 10 + 4cos 2nt + 8cos4nt + 10 cos 20nt. 


Specify both the frequency-response characteristics of a VSB filter that passes the upper side- 
band and the first frequency component of the lower sideband. 
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-f,— 10 -f 0 f2+f, 10+f f 


Figure 3.21 Frequency-response characteristics of the VSB filter in Example 3.2.7. 


Solution The spectrum of the DSB-SC AM signal u(t) = m(t) cos 27 fet is 


U(f) = SCF — fe) +8 + SAHU — f-D+6(F + f+1)) 
+418(f — fe ~D) ECS + foe +2] +S — fe — 10) + 8(f + fe + 10)]. 


The VSB filter can be designed to have unity gain in the range 2 < |f — f,| < 10, a gain of 
1/2at f = fe, a gain of 1/2+« at f = fe + 1, and a gain of 1/2 — «œ at f = f, — 1, where œ 
is some conveniently selected parameter that satisfies the condition 0 < œ < 1/2. Figure 3.21 
illustrates the frequency-response characteristic of the VSB filter. a 


3.3 IMPLEMENTATION OF AMPLITUDE MODULATORS AND DEMODULATORS 


There are several different methods for generating AM-modulated signals. In this section, 
we shall describe the methods most commonly used in practice. Since the process of mod- 
ulation involves the generation of new frequency components, modulators are generally 
characterized as nonlinear and/or time-variant systems. 


Power-Law Modulation. Let us consider the use of a nonlinear device such as a 
P-N diode, which has voltage-current characteristic shown in Figure 3.22. 

Suppose that the voltage input to such a device is the sum of the message signal m (t) 
andthe carrier A, cos 2x fet, as illustratéd in Figure 3.23. The nonlinearity will generate a 
product of the message m (t) with the carrier, plus additional terms. The desired modulated 
signal can be filtered out by passing the output of the nonlinear device through a bandpass 
filter. 

To elaborate, suppose that the nonlinear device has an input-output (square-law) 
characteristic of the form 

volt) = avi (t) + ax? (t), (3.3.1) 


0 D Figure 3.22 Voltage--current characteristic of P—N diode. 





138 Amplitude Modulation Chapter 3 


m(t) Nonlinear 


Bandpass u(t) 
. —— 
device 


— filter 
tuned to fe 






Acos 2xfet 


Figure 3.23 Block diagram of 
power-law AM modulator. 


where v; (t) is the input signal, vo(t) is the output signal, and the parameters (a1, a2) are 
constants. Then, if the input to the nonlinear device is 


vu; (t) = m(t) + Ac cos 2n fet, (3.3.2) 
its output is 
v(t) = ai[m(t) + Ac cos 27 fet] 


+ anm(t) + A,cos2n fot? 


= am (t) + aam? (t) + aA? cos? 27 fot 
+ Acar 1 + e me] cos 27 fet. (3.3.3) 
The output of the bandpass filter with a bandwidth 2W centered at f = fe yields 
u(t) = Aca: + e m] cos 21 fst, (3.3.4) 


where 2a2|m(t)|/a1 < 1 by design. Thus, the signal generated by this method is a conven- 
tional AM signal. 


Switching Modulator. Another method for generating an AM-modulated signal 
is by means of a switching modulator. Such a modulator can be implemented by the system 
illustrated in Figure 3.24(a). The sum of the message signal and the carrier, which is given 
by Equation (3.3.2), are applied to a diode that has the input—output voltage characteristic 
shown in Figure 3.24(b), where A, >> m(t). The output across the load resistor is simply 


_ Juict), c(t)>0 
v(t) = k seed" (3.3.5) 


This switching operation may be viewed mathematically as a multiplication of the 
input v; (t) with the switching function s(t), i.e., 


volt) = [m(t) + A-cos2n fetls(t), (3.3.6) 


where s(t) is shown in Figure 3.24(c). 
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Figure 3.24 Switching modulator and periodic switching signal. 


Since s(t) is a periodic function, it is represented in the Fourier series as 


s(t) = l 





2 oo (~1)"7! 
ae y zy Lg STS n — D). 


This is similar to Equation (2.2.11). 
Hence, 


volt) = [m(t) + Ae cos 2 fet]s(t) 


Ac 
= — |1 
[+ 





4 
mo cos 21 fet + other terms. 
TA, 
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(3.3.7) 


(3.3.8) 


The desired AM-modulated signal is obtained by passing vo(t) through a bandpass filter 
with the center frequency f = f, and the bandwidth 2W. At its output, we have the desired 


conventional AM signal 
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Ac 4 
u(t) = > I + mio cos 21 fet. (3.3.9) 


Balanced Modulator. A relatively simple method for generating a DSB-SC AM 
signal is to use two conventional AM modulators arranged in the configuration illustrated 
in Figure 3.25. 

For example, we may use two square-law AM modulators as previously described. 
Care must be taken to select modulators with approximately identical characteristics so 
that the carrier component cancels out at the summing junction. 


Ring Modulator. Another type of modulator for generating a DSB-SC AM sig- 
nal is the ring modulator illustrated in Figure 3.26. 

The switching of the diodes is controlled by a square wave of frequency fe, denoted 
as c(t), which is applied to the center taps of the two transformers. When c(t) > 0, the 
top and bottom diodes conduct, while the two diodes in the crossarms are off. In this case, 
the message signal m(t) is multiplied by +1. When c(t) < 0, the diodes in the crossarms 
of the ring conduct, while the other two diodes are switched off. In this case, the message 


m(t) AM 
modulator 


| Ac cos 2nf,t 


A [1 + m(d)] cos 2nf,t 






u(t) = 2A,m(t) cos 2nfct 


-mQ| am | : 
—— —————— lt ; : 
modulator = Figure 3.25 Block diagram 
ae Acl — m(¢)] cos 2nfet of a balanced modulator. 
a 5 
m(t) | | 


Square-wave carrier 
at f= fe 


Figure 3.26 Ring modulator for generating a DSB-SC AM signal. 
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signal m(t) is multiplied by —1. Consequently, the operation of the ring modulator may be 
described mathematically as a multiplier of m (t) by the square-wave carrier c(t), i.e., 


vo(t) = m(t)c(t), (3.3.10) 


as shown in Figure 3.26. 
Since c(t) is a periodic function, it is represented by the Fourier series 


yet 


c(t) = a D œ cos[2n f.(2n ~ 1)t]. (3.3.11) 


Again, this equation is similar to Equation (2.2.11). Hence, the desired DSB-SC AM 
signal u(t) is obtained by passing vo(t) through a bandpass filter with the center frequency 
fc and the bandwidth 2W. 

From the preceding discussion, we observe that the balanced modulator and the ring 
modulator systems, in effect, multiply the message signal m(t) with the carrier to pro- 
duce a DSB-SC AM signal. The multiplication of m(t) with Ac cos wet is called a mixing 
operation. Hence, a mixer is basically a balanced modulator. 

The method shown in Figure 3.15 for generating an SSB signal requires two mixers, 
i.e., two balanced modulators, in addition to the Hilbert transformer. On the other hand, 
the filter method illustrated in Figure 3.16 for generating an SSB signal requires a single 
balanced modulator and a sideband filter. 

Let us now consider the demodulation of AM signals. We begin with a description 
of the envelope detector. 


Envelope Detector. As previously indicated, conventional DSB-AM signals are 
easily demodulated by an envelope detector. A circuit diagram for an envelope detector is 
shown in Figure 3.27. It consists of a diode and an RC circuit, which is basically a simple 
lowpass filter. 

During the positive half-cycle of the input signal, the diode conducts and the capaci- 
tor charges up to the peak value of the input signal. When the input falls below the voltage 
on the capacitor, the diode becomes reverse-biased and the input disconnects from the out- 
put. During this period, the capacitor discharges slowly through the load resistor R. On 
the next cycle of the carrier, the diode again conducts when the input signal exceeds the 
voltage across the capacitor. The capacitor again charges up to the peak value of the input 
signal and the process is repeated. 


C 


r(t) 4 R m(t) 
a oe Zz Figure 3.27 An envelope detector. 
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The time constant RC must be selected to follow the variations in the envelope of the 
carrier-modulated signal. If RC is too small, then the output of the filter falls very rapidly 
after each peak and will not follow the envelope of the modulated signal closely. This 
corresponds to the case where the bandwidth of the lowpass filter is too large. If RC is too 
large, then the discharge of the capacitor is too slow and again the output will not follow 
the envelope of the modulated signal. This corresponds to the case where the bandwidth 
of the lowpass filter is too small. The effect of large and small RC values is shown in 
Figure 3.28. 

In effect, for good performance of the envelope detector, we should have 


1 1 
> KRC K. 
fe Ww 


Figure 3.28 Effect of (a) large and 
(b) small RC values on the 
performance of the envelope detector. 
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In such a case, the capacitor discharges slowly through the resistor; thus, the output of the 
envelope detector, which we denote as m(t), closely follows the message signal. 


Example 3.3.1 
Anaudio signal of bandwidth W = 5 kHz is modulated on a carrier of frequency 1 MHz using 
conventional AM. Determine the range of values of RC for successful demodulation of this 
signal using an envelope detector. 


Solution We must have a K RC & a; therefore, 1076 «K RC < 2 x 107“. In this case, 
RC = 1073 is an appropriate choice. a 


Demodulation of DSB-SC AM Signals. As previously indicated, the demodu- 
lation of a DSB-SC AM signal requires a synchronous demodulator. That is, the demod- 
ulator must use a coherent phase reference, which is usually generated by means of a 
phase-locked loop (PLL) (see Section 8.8.1), to demodulate the received signal. 

The general configuration is shown in Figure 3.29. A PLL is used to generate a 
phase-coherent carrier signal that is mixed with the received signal in a balanced modula- 
tor. The output of the balanced modulator is passed through a lowpass filter of bandwidth 
W that passes the desired signal and rejects all signal and noise components above W Hz. 
The characteristics and operation of the PLL are described in Section 8.8.1. 


Demodulation of SSB Signals. The demodulation of SSB-AM signals also 
requires the use of a phase-coherent reference. In the case of signals, such as speech, that 
have relatively little or no power content at DC, it is simple to generate the SSB signal, 
as shown in Figure 3.16. Then, we can insert a small carrier component that is transmitted 
along with the message. In such a case, we may use the configuration shown in Figure 3.30 
to demodulate the SSB signal. We observe that a balanced modulator is used to convert the 
frequency of the bandpass signal to lowpass or baseband. 


Demodulation of VSB Signals. In VSB, acarrier component is generally trans- 
mitted along with the message sidebands, The existence of the carrier component makes it 
possible to extract a phase-coherent reference for demodulation in a balanced modulator, 
as shown in Figure 3.30. 

In applications such as a TV broadcast, a large carrier component is transmitted along 
with the message in the VSB signal. In such a case, it is possible to recover the message 
by passing the received VSB signal through an envelope detector. 


m(t) 


Balanced Lowpass 
modulator filter 


Phase-locked 
loop Figure 3.29 Demodulator for 
a DSB-SC signal. 
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Figure 3.30 Demodulation of 
SSB-AM signal containing a 
carrier component. 


3.4 SIGNAL MULTIPLEXING 


When we use a message signal m(t) to modulate the amplitude of a sinusoidal carrier, we 
translate the message signal by an amount equal to the carrier frequency f,. If we have 
two or more message signals to transmit simultaneously over the communication channel, 
we can have each message signal modulate a carrier of a different frequency, where the 
minimum separation between two adjacent carriers is either 2W (for DSB AM) or W (for 
SSB AM), where W is the bandwidth of each of the message signals. Thus, the various 
message signals occupy separate frequency bands of the channel and do not interfere with 
one another during wansmission. 

Combining separate message signals into a composite signal for transmission over 
a common channel is called multiplexing. There are two commonly used methods for sig- 
nal multiplexing: (1) time-division multiplexing and (2) frequency-division multiplexing 
(FDM). Time-division multiplexing is usually used to wansmit digital information; this 
will be described in Section 7.6. FDM may be used with either analog or digital signal 
transmission. 


3.4.1 Frequency-Division Multiplexing 


In FDM, the message signals are separated in frequency, as previously described. A typical 
configuration of an FDM system is shown in Figure 3.31. This figure illustrates the FDM 
of K message signals at the transmitter and their demodulation at the receiver. The lowpass 
filters (LPFs) at the wansmitter ensure that the bandwidth of the message signals is limited 
to W Hz. Each signal modulates a separate carrier; hence, K modulators are required. 
Then, the signals from the K modulators are summed and transmitted over the channel. 

At the receiver of an FDM system, the signals are usually separated by passing 
through a parallel bank of bandpass filters (BPFs). There, each filter is tuned to one of 
the carrier frequencies and has a bandwidth that is wide enough to pass the desired signal. 
The output of each bandpass filter is demodulated, and each demodulated signal is fed to a 
lowpass filter that passes the baseband message signal and eliminates the double-frequency 
components. 

FDM is widely used in radio and telephone communications. In telephone commu- 
nications, each voice-message signal occupies anominal bandwidth of 4 kHz. The message 
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Figure 3.31 Frequency-division multiplexing of multiple signals. 


signal is single-sideband modulated for bandwidth-efficient transmission. In the first level 
of multiplexing, 12 signals are stacked in frequency, with a frequency separation of 4 kHz 
between adjacent carriers. Thus, a composite 48-kHz channel, called a group channel, 
transmits the 12 voice-band signals simultaneously. In the next level of FDM, a num- 
ber of group channels (typically five or six) are stacked together in frequency to form a 
supergroup channel. Then the composite signal is transmitted over the channel. Higher- 
order multiplexing is obtained by combining several supergroup channels. Thus, an FDM 
hierarchy is employed in telephone communication systems. 


z 


3.4.2 Quadrature-Carrier Multiplexing 


Another type of multiplexing allows us to transmit two message signals on the same 
carrier frequency. This type of multiplexing uses two quadrature carriers, A, cos 21 fet 
and Ae sin 27 fet. To elaborate, suppose that mı (t) and m2(t) are two separate message 
signals to be transmitted over the channel. The signal m,(t) amplitude modulates the 
carrier A, cos 21 ft, and the signal m2(t) amplitude modulates the quadrature carrier 
A, sin 21 f,t. The two signals are added together and transmitted over the channel. Hence, 
the transmitted signal is 


u(t) = Aem; (t) cos 27 fet + Aem (t) sin 27 fot. (3.4.1) 


Therefore, each message signal is transmitted by DSB-SC AM. This type of signal multi- 
plexing is called quadrature-carrier multiplexing. Quadrature-carrier multiplexing results 
in a bandwidth-efficient communication system that is comparable in bandwidth efficiency 
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Figure 3.32 Quadrature-carrier multiplexing. 


to SSB AM. Figure 3.32 illustrates the modulation and demodulation of the quadrature- 
carrier multiplexed signals. As shown, a synchronous demodulator is required at the 
receiver to separate and recover the quadrature-carrier-modulated signals. 

Demodulation of mı (t) is done by multiplying u(t) by cos27ı fet and then passing 
the result through a lowpass filter. We have 


u(t) cos 2x fet = Aemı (t) cos? 2nfet + Acem (t) cos 2n fet sin 27 fot 


A A A 
= zm (t) + zm) cos 4m fet + z malt) sin 4x fot. 
This signal has a lowpass component 4m (t) and two high-frequency components. The 
lowpass component can be separated using a lowpass filter. Similarly, to demodule m2(t), 
we can multiply u(t) by sin 21 fet and then pass the product through a lowpass filter. 


3.5 AM RADIO BROADCASTING 


AM radio broadcasting is a familiar form of communication via analog signal transmission. 
Commercial AM radio broadcasting utilizes the frequency band 535-1605 kHz for the 
transmission of voice and music. The carrier-frequency allocations range from 540-1600 
kHz with 10 kHz spacing. 

Radio stations employ conventional AM for signal wansmission. The baseband- 
message signal m(t) is limited to a bandwidth of approximately 5 kHz. Since there are 
billions of receivers and relatively few radio transmitters, the use of conventional AM for 
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Figure 3.33 A superheterodyne receiver. 


broadcast is justified from an economic standpoint. The major objective is to reduce the 
cost of implementing the receiver. 

The receiver most commonly used in AM radio broadcast is the so-called super- 
heterodyne receiver shown in Figure 3.33. It consists of a radio-frequency (RF)-tuned 
amplifier, a mixer, a local oscillator, an intermediate-frequency (IF) amplifier, an envelope 
detector, an audio-frequency amplifier, and a loudspeaker. Tuning for the desired radio fre- 
quency is provided by a variable capacitor, which simultaneously tunes the RF amplifier 
and the frequency of the local oscillator. 

In the superheterodyne receiver, every AM radio signal is converted to a common 
intermediate frequency of fi = 455 kHz. This conversion allows the use of a single-tuned 
IF amplifier for signals from any radio station in the frequency band. The IF amplifier is 
designed to have a bandwidth of 10 kHz, which matches the bandwidth of the transmitted 
signal. 

The frequency conversion to IF is performed by the combination of the RF amplifier 
and the mixer. The frequency of the local oscillator is 


feet se 


where fo is the carrier frequency of the desired AM radio signal. The tuning range of 
the local oscillator is 995—2055 kHz. By tuning the RF amplifier to the frequency f, and 
mixing its output with the local oscillator frequency fio = fe + fir, we obtain two signal 
components; one is centered at the difference frequency fip, and the second is centered at 
the sum frequency 2 fe + fir. Only the first component is passed by the IF amplifier. 

At the input to the RF amplifier, we have signals that are picked up by the antenna 
from all radio stations. By limiting the bandwidth of the RF amplifier to the range Be < 
Bre < 2 fir, where B, is the bandwidth of the AM radio signal (10 kHz), we can reject the 
radio signal transmitted at the so-called image frequency f! = fio + fir. When we mix 
the local oscillator output cos 2v fot with the received signals 


r, (t) = A-[] + m (t)] cos 2x fet 
ro(t) = A-[1 + m2(t)] cos2n fit, 
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where fs = fio — fir and fi = fio + fir, the mixer output consists of the two signals 


yı (t) = A-[1 + mı (t)] cos 2x fir (t) + double-frequency term, 
y2(t) = Ac[1 + m2(t)] cos 2x fr (t) + double-frequency term, 


where m; (t) represents the desired signal and m2(t) is the signal sent by the radio station 
transmitting at the carrier frequency f! = fio + fir. To prevent the signal r2(t) from 
interfering with the demodulation of the desired signal r1(¢), the RF-amplifier bandwidth 
is sufficiently narrow so the image-frequency signal is rejected. Hence, Brr < 2 fip is the 
upper limit on the bandwidth of the RF amplifier. In spite of this constraint, the band- 
width of the RF amplifier is still considerably wider than the bandwidth of the IF amplifier. 
Thus, the IF amplifier, with its narrow bandwidth, provides signal rejection from adjacent 
channels, and the RF amplifier provides signal rejection from image channels. Figure 3.34 
illustrates the bandwidths of both the RF and IF amplifiers and the requirement for rejecting 
the image-frequency signal. 

The output of the IF amplifier is passed through an envelope detector, which produces 
the desired audio-band message signal m(t). Finally, the output of the envelope detector is 
amplified, and this amplified signal drives a loudspeaker. Automatic volume control (AVC) 


— Bee 
0 fr f 
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Bej- 
0 fe fo fc=fiotfr f 


Figure 3.34 Frequency-response 
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is provided by a feedback-control loop, which adjusts the gain of the IF amplifier based on 
the power level of the signal at the envelope detector. 


3.6 SUMMARY AND FURTHER READING 


In this chapter, we covered amplitude modulation (AM) of analog signals. The AM meth- 
ods treated were double-sideband, suppressed carrier (DSB-SC) AM, conventional AM, 
and single-sideband (SSB) AM. The implementation of different types of AM modulators 
and demodulators was also treated. In the final topic of this chapter, we described the use 
of AM in radio broadcasting. 

Amplitude modulation is treated in numerous books on basic communication sys- 
tems, including the books by Shanmugam (1979), Carlson (1986), Stremler (1990), Couch 
(1993), Gibson (1993), Haykin (2000), and Ziemer and Tranter (2002). The implementa- 
tion of analog communications systems is treated in detail by Clark and Hess (1971). 


APPENDIX 3A: DERIVATION OF THE EXPRESSION FOR SSB-AM SIGNALS 


Let m(t) be asignal with the Fourier transform (spectrum) M (f ). An upper single-sideband 
amplitude-modulated (USSB AM) signal is obtained by eliminating the lower sideband 
of a DSB-AM signal. Suppose we eliminate the lower sideband of the DSB-AM signal, 
upsg (t) = 2A-m(t) cos 27 f-t, by passing it through a highpass filter whose transfer func- 
tion is given by 


1, Ifl> fe 


0, otherwise f 


m=] 


as shown in Figure 3.16. Obviously, H (f) can be written as 


H(f) =u(f - fð tuaCf— fo. 


where u_;(-) represents the unit-step function. Therefore, the spectrum of the USSB-AM 
signal is given by 


Uu(f) = AM(f — foju- (f — fe) + AMS + felu-i(-f — fe), 
or equivalently, 
Valf) = AcM(f uP) pay_p, + AM Pu) papa (3A.1) 


Taking the inverse Fourier transform of both sides of Equation (3A.1) and using the mod- 
ulation and convolution properties of the Fourier transform, as shown in Example 2.3.14 
and Equation (2.3.26), we obtain 


uy(t) = Acm(t) x GF" [u_i(f)] ei2nfet 
+ Aom(t)* F7! [u (= f] oie fet (3A.2) 
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Next, we note that 


1 j 
a| 580) + Żl] = u_\(f) 
1 ; , (3A.3) 
Fls- -|= — 
EQ se] anu f) 


which follows from Equation (2.3.12) and the duality theorem of the Fourier wansform. 
Substituting Equation (3A.3) in Equation (3A.2), we obtain 


1 ; 
u,(t) = Acm(t) x [580 + A e27 fet 
1 j ; 
+ Acm(t) «| 280) — LE | eite 
2 2nt 


= a [m(t) + jhe) eA 


A ; 
+ S [m(t) — jra(t)] eie, (3A.4) 
where we have used the identities 
m(t) x d(t) = m(t), 
1 
m(t) x — = Ĥ(t). 
TÉ 
Using Euler’s relations in Equation (3A.4), we obtain 
u,(t) = Acm(t) cos 27 fet — Acm(t) sin 27 fet, (3A.5) 


which is the time-domain representation of a USSB-AM signal. The expression for the 
LSSB-AM signal can be derived by noting that 


Uu (t) + ue(t) = upsg(t) 
or 


Acm(t)cos2n fet — AcM(t) sin 27 fet + uelt) = 2Aem(t) cos 27 fet. 


Therefore, 


ue(t) = Acm(t) cos 2n fet + Acm(t) sin 2r fet. (3A.6) 
Thus, the time-domain representation of a SSB-AM signal can generally be expressed as 
ussp(t) = A-m(t) cos2n fot  A,m(t) sin2n fot, (3A.7) 


where the minus sign corresponds to the USSB-AM signal, and the plus sign corresponds 
to the LSSB-AM signal. 
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PROBLEMS 


3.1 The message signal m(t) = 2 cos 400t + 4 sin(500t + 3) modulates the carrier sig- 
nal c(t) = A cos(8000xż), using DSB amplitude modulation. Find the time-domain 
and frequency-domain representations of the modulated signal and plot the spec- 
trum (Fourier transform) of the modulated signal. What is the power content of the 
modulated signal? 


(33 In a DSB system, the carrier is c(t) = A cos 27 fet and the message signal is given 
by m(t) = sinc(t) + sinc?(t). Find the frequency-domain representation and the 
bandwidth of the modulated signal. 


3.3 The two signals (a) and (b), shown in Figure P-3.3, DSB modulate a carrier signal 
c(t) = Acos2x fot. Precisely plot the resulting modulated signals as a function of 
time and discuss their differences and similarities. 


m(t) j m(t) j 





Figure P-3.3 


3.4 Suppose the signal x (t) = m (t) + cos 27 fet is applied to a nonlinear system whose 
output is y (t) = x (t) + 5x? (t). Determine and sketch the spectrum of y (t) when 
M (f) is as shown in Figure P-3.4 and W < fe. 


M(f) } 





Figure P-3.4 
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3.5 


3.6 


3.7 


3.8 


3.9 
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The modulating signal 
m (t) = 2 cos 4000xt + 5 cos 6000nt 
is multiplied by the carrier 
c (t) = 100 cos 27 fet, 


where fe = 50 kHz. Determine and sketch the spectrum of the DSB signal. 


A DSB-modulated signal u(t) = Am(t) cos 2v f,t is mixed (multiplied) with a local 
carrier x(t) = cos(2n fet + 0), and the output is passed through a lowpass filter 
with a bandwidth equal to the bandwidth of the message m(t). The signal power at 
the output of the lowpass filter is denoted by Poy. The modulated signal power is 
denoted by Py. Plot = as a function of 6 for0 < 0 < x. 





An AM signal has the form 
u (t) = [20 + 2 cos 3000xt + 10 cos 6000rtt] cos 2v fst, 


where f, = 10°Hz. 


1. Sketch the (voltage) spectrum of u(t). 
2. Determine the power in each of the frequency components. 
3. Determine the modulation index. 


4. Determine the sidebands’ power, the total power, and the ratio of the sidebands’ 
power to the total power. 


A message signal m (t) = cos 2000nt + 2 cos 4000t modulates the carrier c (t) = 
100 cos 2x fet, where fe = 1 MHz to produce the DSB signal m (t) c (t). 


1. Determine the expression for the upper-sideband (USB) signal. 


2. Determine and sketch the spectrum of the USB signal. 


A DSB-SC signal is generated by multiplying the message signal m (t) with the 
periodic rectangular waveform (shown in Figure P-3.9), then filtering the product 
with a bandpass filter tuned to the reciprocal of the period Tp, with the bandwidth 
2W, where W is the bandwidth of the message signal. Demonstrate that the output 
u (t) of the bandpass filter (BPF) is the desired DSB-SC AM signal 


u (t) = m (t) sin 27 fot, 


where fe = 1/Tp. 
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Figure P-3.9 


3.10 Show that while generating a DSB-SC signal in Problem 3.9, it is not necessary for 
the periodic signal to be rectangular. This means that any periodic signal with the 
period T, can substitute for the rectangular signal in Figure P-3.9. 


3.11 The message signal m(t) has the Fourier transform shown in Figure P-3.1 1(a). This 
signal is applied to the system shown in Figure P-3.11(b) to generate the signal y(t). 
1. Plot Y (f), the Fourier transform of y(t). 


2. Show that if y(t) is transmitted, the receiver can pass it through a replica of 
the system shown in Figure P-3.11(b) to obtain m(t) back. This means that this 
system can be used as a simple scrambler to enhance communication privacy. 


M(f) } 





(a) 


m(t) | LPF | 


oe faa =f (wend W, w] i. 


Acos2nf,t A cos2x( f, + W)t 


(b) Figure P-3.11 
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3.12 Show that in a DSB-modulated signal, the envelope of the resulting bandpass signal 
is proportional to the absolute value of the message signal. This means that an enve- 
lope detector can be employed as a DSB demodulator if we know that the message 
signal is always positive. 


3.13 An AM signal is generated by modulating the carrier fe = 800 kHz by the signal 
m (t) = sin 2000nt + 5 cos 4000nt. 


| The AM signal 
u(t) = 100[1 +m (t)] cos 2x fot 


! is fed to a 50-2 load. 


1. Determine and sketch the spectrum of the AM signal. 
2. Determine the average power in the carrier and in the sidebands. 
3. What is the modulation index? 


4. What is the peak power delivered to the load? 
3.14 The output signal from an AM modulator is 
u (t) = 5 cos 1800nt + 20cos 2000nt + 5 cos 2200nt. 


1. Determine the modulating signal m (t) and the carrier c (t). 
2. Determine the modulation index. 
| 


i 3. Determine the ratio of the power in the sidebands to the power in the carrier. 
3.15 A DSB-SC AM signal is modulated by the signal 
| m (t) = 2 cos 2000xt + cos 6000nt. 
The modulated signal is 
u(t) = 100m (t) cos 27 fit, 
where fe = 1 MHz. 





1. Determine and sketch the spectrum of the AM signal. 


2. Determine the average power in the frequency components. 
3.16 An SSB-AM signal is generated by modulating an 800 kHz carrier by the signal 
m (t) = cos 2000nt + 2 sin 2000xt. The amplitude of the carrier is A, = 100. 


1. Determine the signal m (t). 


2. Determine the (time-domain) expression for the lower sideband of the SSB-AM 
signal. 


3. Determine the magnitude spectrum of the lower-sideband-SSB signal. 
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3.17 Weaver’s SSB modulator is illustrated in Figure P-3.17. By taking the input signal as 


m (t) = cos 21 fmt where fm < W, demonstrate that by proper choice of f and fo, 
the output is an SSB signal. 


m(t) 
—_—- 





Figure P-3.17 


3.18 The message signal m(t), whose spectrum is shown in Figure P-3.18, is passed 
through the system shown in that figure. 


cos (2 nf?) 


m(i) x(t) Square law yi) =x) Bandpass yal?) ys(?) Lowpass yale) 
device filter f Iter 


cos (21 fot) M(f) 





Figure P-3.18 


The bandpass filter has a bandwidth of 2W centered at fo, and the lowpass filter has 
a bandwidth of W. Plot the spectra of the signals x(t), yı (t), ya(t), y3(t), and y4(t). 
What are the bandwidths of these signals? 
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3.19 The system shown in Figure P-3.19 is used to generate an AM signal. The modulat- 
ing signal m (t) has zero mean and its maximum (absolute) value is Am =max |m (t) |. 
The nonlinear device has the input-output characteristic 


y (t) = ax (t) + bx? (t). 


1. Express y (t) in terms of the modulating signal m (t) and the carrier c (t) = 
cos2n fet. 
2. What is the modulation index? 


3. Specify the filter characteristics that yield an AM signal at its output. 


m(t) x(t) | Nonlinear | y(s) wt) 
memoryless |——> ee 
system ilter AM signal 


c(t) = cos (2nfo¢) Figure P-3.19 


3.20 The signal m(t), whose Fourier transform M (f) is shown in Figure P-3.20, is to be 


transmitted from point A to point B. We know that the signal is normalized, meaning 
that —1 < m(t) < 1. 


MCPA 





—10,000 





10,000 
Figure P-3.20 


1. If USSB is employed, what is the bandwidth of the modulated signal? 

2. If DSB is employed, what is the bandwidth of the modulated signal? 

3. If an AM scheme with a = 0.8 is used, what is the bandwidth of the modulated 
signal? 


3.21 A vestigial-sideband modulation system is shown in Figure P-3.21. The bandwidth 


of the message signal m(t) is W, and the transfer function of the bandpass filter is 
shown in the figure. 
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1. Determine A; (t), which is the lowpass equivalent of h(t), where h(t) represents 


the impulse response of the bandpass filter. 


2. Derive an expression for the modulated signal u(t). 


mÀ BPF VSB signal 
H(f) u(t) 


A, cos (21f,0) 








f+ pw ff 


Figure P-3.21 


3.22 Find expressions for the in-phase and quadrature components, x-(t) and x;(t), as 
well as the envelope and phase, V(t) and @(t), for DSB, SSB, conventional AM, 
USSB, and lower SSB (LSSB). 


3.23 The normalized signal m, (t) has a bandwidth of 10,000 Hz, and its power content is 
0.5 Watts. The carrier A cos 21 fot has a power content of 200 Watts. 


1. If m,(t) modulates the carrier using SSB amplitude modulation, what will be 
the bandwidth and the power content of the modulated signal? 


2. If the modulation scheme is DSB SC, what is the answer to Part 1? 


3. If the modulation scheme is AM with a modulation index of 0.6, what is the 
answer to Part 1? 


3.24 We wish to transmit 60 voice-band signals by SSB (upper-sideband) modulation and 
frequency-division multiplexing (FDM). Each of the 60 signals has a spectrum as 
shown in Figure P-3.24. Note that the voiceband signal is band limited to 3 kHz. 
If each signal is frequency translated separately, we require a frequency synthesizer 
that produces 60 carrier frequencies to perform the FDM. On the other hand, if we 
subdivide the channels into L groups of K subchannels each, such that LK = 60, 
we may reduce the number of frequencies from the synthesizer to L + K. 


1. Illustrate the spectrum of the SSB signals in a group of K subchannels. Assume 
that a 1 kHz guard band separates the signals in adjacent frequency subchannels 
and that the carrier frequencies are fe, = 10 kHz, Jao = 14 kHz, ..., etc. 
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2. Sketch L and K suchthat LK = 60 and L + K is a minimum. 
3. Determine the frequencies of the carriers if the 60 FDM signals occupy the 


frequency band 300-540 kHz, and each group of K signals occupies the band 
10 kHz to (10 + 4K) kHz. 


IMCA) i 


—3000 0 3000 f 


Figure P-3.24 


COMPUTER PROBLEMS 


3.1 Double-sideband (DSB) AM 
The message signal m(t) is given by 


_ f sinc(100t), O<t < to 
m(t) = | 0 otherwise 


> 


where fo = 0.1. The message signal modulates the carrier c(t) = cos 27t fet, where 
Je = 250 Hz, to produce a DSB-AM signal u(t). 


1. By selecting the sampling interval ts = 0.0001, generate samples of m(t) and 
u(t) for O < t < to and plot them. 
2. Determine and plot the spectra of m (t) and u(t). 
3. Repeat Parts 1 and 2 when fo = 0.4, and comment on the results between 
to = 0.1 and to = 0.4. 
3.2 Conventional AM 


The message signal m(t), which is given in Problem CP-3.1, modulates the carrier 
c(t) = cos 27 fet using conventional AM. The carrier frequency is fe = 250 Hz and 
the modulation index is a = 0.80. 


1. Plot the message signal m(t) and the modulated signal u(t) using a sampling 
interval t, = 0.0001. 


2. Determine and plot the spectra of the message signal m(t) and the modulated 
signal u(t). 
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3. Repeat Parts 1 and 2 when tọ = 0.4, and comment on the results between tọ = 
0.1 and to = 0.4. 


3.3 Single-sideband AM 


The message signal m(t), which is given in Problem CP-3.1, modulates the carrier 


c(t) = cos21 fst and produces the lower SSB signal u(t). The carrier frequency is 
fe = 250 Hz. 


1. Plot the message signal m(t), its Hilbert transform (t), and the modulated 
LSSB signal u(t). 


2. Determine and plot the spectra of the message signal m(t) and the modulated 
LSSB signal u(t). 


3. Repeat Parts 1 and 2 when t = 0.4, and comment on the results between 
to = 0.1 and tp = 0.4. 


3.4 Demodulation of the DSB-AM Signal 


The message signal m(t), which is given in Problem CP-3.1, modulates the carrier 
c(t) = cos2nf,t and results in the DSB-AM signal u(t) = m(t)c(t). The carrier 
frequency fe = 250 Hz and tọ = 0.1. 


1. By selecting the sampling interval t = 0.0001, generate 1000 samples of the 
message signal m(t) and the modulated signal u(t), and plot both signals. 


2. Demodulate the sampled DSB-AM signal u(t) generated in Part 1 by using the 
demodulator shown in Figure CP-3.4. Perform the demodulation for ¢ = 0, 
1/8, 1/4, and 1/2, and plot the received message signal m,(t). The lowpass 
filter is a linear-phase FIR filter having 31 taps, a cutoff frequency (—3 dB) of 
100 Hz, and a stopband attenuation of at least 30 dB. 


3. Comment on the results obtained in Part 2. 


4. Instead of using a time-domain lowpass filter to reject the frequency compo- 
nents centered at 2 fe, compute the discrete-Fourier transform (DFT) of 1000 
samples of the mixer output, set to zero those frequency components centered 
at 2 fe and compute the inverse DFT to obtain the time-domain signal. Compare 
the results of this frequency-domain filtering with the time-domain filtering in 


Part 2. 
i m,(t) 
Received C 4; ` | poe Output 
signa | message signal 
Figure CP-3.4 Demodulation 
Cos (2nf,t + ) 


for DSB-SC AM signal. 
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3.5 Demodulation of the SSB-AM Signal 


The message signal m(t), which is given in Problem CP-3.1, modulates the carrier 
c(t) = cos 27 fet and produces the lower SSB signal u(t). The carrier frequency is 
fe = 250 Hz. 


1. By selecting the sampling interval t = 0.0001, generate 1000 samples of the 
message signal m(t), its Hilbert transform ñ (t), and the modulated LSSB signal 
u(t). Plot these three signals. 


2. Demodulate the sampled LSSB signal u(t) generated in Part 1 by using the 
demodulator shown in Figure CP-3.4. Perform the demodulation for ¢ = 0, 
1/8, 1/4, and m/2, and plot the demodulated received message signal m, (t). 
The characteristics of the lowpass filter to be designed are given in Part 2 of 
Problem CP-3.4. 


3. Comment on the results obtained in Part 2. 

4. Instead of using the time-domain filter in Part 2, suppose the filtering of the 
frequency components centered at 2 f, is performed in the frequency domain by 
using the DFT as described in Part 4 of Problem CP-3.4. Perform the filtering in 


the frequency domain and compare the demodulated signal with that obtained 
by time-domain filtering. 


3.6 Demodulation of Conventional AM 


The message signal m(t), which is given in Problem CP-3.1, modulates the carrier 
c(t) = cos 2x fet to produce a conventional AM signal. The carrier frequency is 
fe = 250 Hz and the modulation index is a = 0.80. 


1. By selecting the sampling interval t = 0.0001, generate 1000 samples of the 
message signal m(t) and the modulated conventional AM signal u(t). Plot these 
two signals. 


2. Demodulate the sampled conventional AM signal u(t) generated in Part 1 by 
computing the envelope of u(t), i.e., by computing 


e(t)=V[1+am(t)? = |1 +am(t)| 


and subtracting the DC value term to obtain the demodulated signal m, (t). Plot 
the demodulated received-message signal m, (t). 


3. Comment on the results obtained in Part 2. 










































































































































































































































































































































































































































































Angle Modulation 7 









































































































































































































































































































































































































































































































































In Chapter 3, we considered amplitude modulation (AM) of the carrier as a means for 
transmitting the message signal. Amplitude-modulation methods are also called linear- 
modulation methods, although conventional AM is not linear in the strict sense. 

Another class of modulation methods includes frequency modulation (FM) and phase 
modulation (PM), which are described in this chapter. In FM systems, the frequency of the 
carrier f, is changed by the message signal; in PM systems, the phase of the carrier is 
changed according to the variations in the message signal. Both FM and PM are nonlin- 
ear, and often they are jointly called angle-modulation methods. In the following sections, 
we will show that, angle modulation, due to its inherent nonlinearity, is more complex 
to implement and much more difficult to analyze. In many cases, only an approximate 
analysis can be done. Another property of angle modulation is its bandwidth-expansion 
property. The FM and PM systems generally expand the bandwidth such that the effective 
bandwidth of the modulated signal is usually many times the bandwidth of the message 
signal.! With a higher implementation complexity and a higher bandwidth occupancy, we 
would naturally question the usefulness of these systems. As our analysis in Chapter 6 
will show, the major benefit of these systems is their high degree of noise immunity. In 
fact, these systems sacrifice bandwidth for high-noise immunity. That is the reason that 
FM systems are widely used in high-fidelity music broadcasting and point-to-point com- 
munication systems, where the transmitter power is quite limited. Another advantage of 
angle-modulated signals is their constant envelope, which is beneficial when the signal is 
amplified by nonlinear amplifiers. 


4.1 REPRESENTATION OF FM AND PM SIGNALS 


An angle-modulated signal generally can be written as 


u(t) = Ac cos(2x fet + b(t)), (4.1.1) 


'Swictly speaking, the bandwidth of the modulated signal is infinite. That is why we talk about the effective 
bandwidth. 
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where f, denotes the carrier frequency and ¢ (t) denotes a time-varying phase. The instan- 
taneous frequency of this signal is given by 


1 d 
(t) = —— e(t). 4.1.2 
fil) fot 5G 9) (4.1.2) 
If m(t) is the message signal, then in a PM system, the phase is proportional to the mes- 
sage, i.e., 


b(t) =kpm(t), (4.1.3) 


and in an FM system, the instantaneous frequency deviation from the carrier frequency is 
proportional to the message signal, i.e., 


1 d 
i(t)— =k t) = —— 4.1.4 
Sit) — fe = kpm(t) PERAS (4.1.4) 
where kp and ky are phase and frequency deviation constants. From the preceding rela- 
tionships, we have 


kpm(t), PM 


ot) = f . 
2ks f o m(t)dt, FM 


(4.1.5) 


The foregoing expression shows a close and interesting relationship between FM and PM 
systems. This close relationship allows us to analyze these systems in parallel and only 
emphasize their main differences. First, note that if we phase modulate the carrier with the 
integral of a message, it is equivalent to the frequency modulation of the carrier with the 
original message. On the other hand, this relation can be expressed as 


d 
kp 0), PM 


2nxkşm(t), FM 


d 
< $(0) = (4.1.6) 


which shows that if we frequency modulate the carrier with the derivative of a message, the 
result is equivalent to the phase modulation of the carrier with the message itself. Figure 4.1 
shows the above relation between FM and PM. Figure 4.2 illuswates a square-wave signal 
and its integral, a sawtooth signal, and their corresponding FM and PM signals. 

The demodulation of an FM signal involves finding the instantaneous frequency of 
the modulated signal and then subwacting the carrier frequency from it. In the demodula- 
tion of PM, the demodulation process is done by finding the phase of the signal and then 
recovering m (t). The maximum phase deviation in a PM system is given by 


Admax = kp max[|m(t)]], (4.1.7) 
andthe maximum frequency deviation in an FM system is given by 


Afmax = kf max[|m(t)|]. (4.1.8) 
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Example 4.1.1 
The message signal 


m(t) = acos(20 fmt) 


is used to either frequency modulate or phase modulate the carrier A, cos(2n ft). Find the 
modulated signal in each case. 


Solution In PM, we have 


Q(t) =kpm(t) = kpa cos(2n fmt), (4.1.9) 
and in FM, we have 
b(t) = 27ks [ m(t)dt = se sin(2n fmt). (4.1.10) 


Therefore, the modulated signals will be 


A, cos (27 ft + kpa cos(27 fmt)), PM 
u(t) = ( m s ) . (4.1.11) 
A; cos (2x fat ta sin(27 fint) , FM 
By defining 
Bp = kpa (4.1.12) 
and 
kya 
=, (4.1.13) 
comes? 
we have 
we A, COs (2x fet + bp cos(27 fmt)) , PM (4.1.14) 


A, cos (27 fet + By sin(2fmt)), FM” 


The parameters 6, and By are called the modulation indices of the PM and FM systems, 
respectively. = 


We can extend the definition of the modulation index for a general nonsinusoidal 
signal m(t) as 


Bp = kp max[|m(¢)|]; (4.1.15) 
_ kg max[|m(t)]] 
By = wt (4.1.16) 


where W denotes the bandwidth of the message signal m(t). In terms of the maximum 
phase and frequency deviation Admax and Afmax, we have 


Bp = Ama; (4.1.17) 


A fmax 
bBr= faas, (4.1.18) 
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Narrowband Angle Modulation”. Consider an angle-modulation system in 
which the deviation constants k, and ky and the message signal m(t) are such that for 
all t, we have ¢(t) < 1. Then we can use a simple approximation to expand u(t) in 
Equation (4.1.1) as 


u(t) = Ac cos 2v fst cos ġ (t) — Ac sin 27 fet sin p(t) 
x A cos 2n fet — Ach (t) sin 27 fet, (4.1.19) 


where we have used the approximations cos ø (t) © 1 and sing (t) © P(t) foro (t) « 1. 
Equation (4.1.19) shows that in this case, the modulated signal is very similar to a con- 
ventional AM signal given in Equation (3.2.5). The only difference is that the message 
signal m(t) is modulated on a sine carrier rather than a cosine carrier. The bandwidth of 
this signal is similar to the bandwidth of a conventional AM signal, which is twice the 
bandwidth of the message signal. Of course, this bandwidth is only an approximation of 
the real bandwidth of the FM signal. Phasor diagrams for this signal and the compara- 
ble conventional AM signal are given in Figure 4.3. Compared to conventional AM, the 
narrowband angle-modulation scheme has far less amplitude variations. Of course, the 
angle-modulation system has constant amplitude and, hence, there should be no amplitude 
variations in the phasor-diagram representation of the system. These slight variations are 
due to the first-order approximation that we have used for the expansions of sin(¢ (t)) and 
cos(f(t)). As we will see later, the narrowband angle-modulation method does not pro- 
vide better noise immunity than a conventional AM system. Therefore, narrowband angle 
modulation is seldom used in practice for communication purposes. However, these sys- 
tems can be used as an intermediate stage for the generation of wideband angle-modulated 
signals, as we will discuss in Section 4.3. 


—sin 
| Vt) 
l A. [Ac am(t) Al COs 


(a) 








—sin Vv.) 
Ap (D 
A cos 
k c —| Figure 4.3 Phasor diagrams for the 
conventional AM and narrowband angle 
(b) modulation. 


? Also known as low-index angle modulation. 
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4.2 SPECTRAL CHARACTERISTICS OF ANGLE-MODULATED SIGNALS 


Due to the inherent nonlinearity of angle-modulation systems, the precise characterization 
of their spectral properties, even for simple message signals, is mathematically intractable. 
Therefore, the derivation of the spectral characteristics of these signals usually involves 
the study of simple modulating signals and certain approximations. Then the results are 
generalized to the more complicated messages. We will study the spectral characteristics 
of an angle-modulated signal when the modulating signal is a sinusoidal signal. 


4.2.1 Angle Modulation by a Sinusoidal Signal 


Consider the case where the message signal is a sinusoidal signal (to be more precise, sine 
in PM and cosine in FM). As we have seen in Example 4.1.1, in this case for both FM and 
PM we have 

u(t) = A,cos(2n fet + Bsin2nfint), (4.2.1) 


where £ is the modulation index that can be either B, or By, and in PM sin2nfmt is 
substituted by cos 2x fmt. Using Euler’s relation, the modulated signal can be written as 


u(t) = Re (Aef eib inn fmt) (4.2.2) 
Since sin 21 ft is periodic with period Tm = = the same is true for the complex expo- 
nential signal 
eb sin 2x fmt 


Therefore, it can be expanded in a Fourier-series representation. The Fourier-series coeffi- 
cients are obtained from the integral 


1 
a fig | e 
0 


2 
u=2T fmt =l Š eB sinu—nu) du. (4.2.3) 
21 0 


This latter expression is a well-known integral called the Bessel function of the first kind 
of order n and is denoted by J, (8). Therefore, we have the Fourier series for the complex 


exponential as 
co 


eb inafnt = So Ibe. (4.2.4) 


n=—00 
By substituting Equation (4.2.4) into Equation (4.2.2), we obtain 


u(t) = Re (4 y Iper int ernt) 


n=—00 


= )> AcJn(B) cos (27 (fe +nfm)t). (4.2.5; 


n=—00 
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The preceding relation shows that, even in this very simple case where the modulating sig- 
nal is a sinusoid of frequency fm, the angle-modulated signal contains all frequencies of 
the form fe + nfm forn = 0, +1, +2,.... Therefore, the actual bandwidth of the modu- 
lated signal is infinite. However, the amplitude of the sinusoidal components of frequencies 
fc nfm for largen is very small. Hence, we can define a finite effective bandwidth for the 
modulated signal. For small 8, we can use the approximation 
B” 

2n! 

Thus, for a small modulation index £, only the first sideband corresponding to n = 1 
is important. Also, we can easily verify the following symmetry properties of the Bessel 
function: 





J) © (4.2.6) 


Ja (B), n even 


. J-n(b) = J, (B), oid x 


(4.2.7) 


Plots of J, (8) for various values of n are given in Figure 4.4. Table 4.1 shows the 
number of harmonics required to include 80%, 90%, and 98% of the total power of the FM 
signals, respectively. 


Plots of Bessel functions J,,(8) 





Figure 4.4 Bessel functions for various values of n. 


TABLE 4.1 REQUIRED NUMBER OF HARMONICS IN FM 


Power (%) p=0.5 p=l p=2 p=5 p=8 Bp =10 p=15 
80 = 1 2 4 7 9 14 
90 1 1 2 5 8 10 15 


98 1 2 3 6 9 11 16 
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Example 4.2.1 
Let the carrier be given by c(t) = 10cos(2nf-t), and let the message signal be cos(20m?). 
Further assume that the message is used to frequency modulate the carrier with kp = 50. 
Find the expression for the modulated signal and determine how many harmonics should be 
selected to contain 99% of the modulated signal power. 


Solution The power content of the carrier signal is given by 


A? 100 
P, = -= = — = 50. 4.2. 
2 7 (4.2.8) 
The modulated signal is represented by 
t 
u(t) = 10 cos (ans + 20k / cos(20nT) ar) 
—00 


= 10 cos (ase + = sin(20%1) 
= 10cos(2nf-t + 5 sin(20m72)). (4.2.9) 


The modulation index is given by Equation (4.1.16) as 


max([|m(t)|] = 


=k 
als ae 


5; (4.2.10) 


therefore, the FM-modulated signal is 


u(t) = Ý AcJ,(B) cos (2x( fe + nfm)t) 


= > 10J,(5) cos (2n( fe + 10n)r). (4.2.11) 


n=—00 

The frequency content of the modulated signal is concentrated at frequencies of the form 
fe + 10n for various n. To make sure that at least 99% of the total power is within the effective 
bandwidth, we must choose a k large enough such that 


n=k 
100J2(5 
D ae > 0.99 x 50. (4.2.12) 


n=—k 


This is a nonlinear equation and its solution (for k) can be found by trial-and-error and by using 
tables of the Bessel functions. In finding the solution to this equation, we must employ the 
symmetry properties of the Bessel function given in Equation (4.2.7). Using these properties, 
we have 


k 
50 E 6)+2> > o] > 49.5. (4.2.13) 


n=1 


Starting with small values of k and increasing it, we see that the smallest value of k for which 
the left-hand side exceeds the right-hand side is k = 6. Therefore, taking frequencies f, + 10k 
for 0 < k < 6 guarantees that 99% of the power of the modulated signal has been included 
and only 1% has been left out. This means that if the modulated signal is passed through an 
ideal bandpass filter centered at fe with a bandwidth of at least 120 Hz, only 1% of the signal 
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B. = 120 Hz i B, = 120 Hz 








yr 


A 





Figure 4.5 The harmonics present inside the effective bandwidth of Example 4.2.1. 


power will be eliminated. This gives us a practical way to define the effective bandwidth of the 
angle-modulated signal as 120 Hz. Figure 4.5 shows the frequencies present in the effective 
bandwidth of the modulated signal. a 


In general, the effective bandwidth of an angle-modulated signal, which contains at 
least 98% of the signal power, is given by the relation 


Be = 2(B + 1) fm, (4.2.14) 


where £ is the modulation index and fmn is the frequency of the sinusoidal message signal. It 
is instructive to study the effect of the amplitude and frequency of the sinusoidal message 
signal on the bandwidth and the number of harmonics in the modulated signal. Let the 
message signal be given by 


m(t) = a cos(2 7T fmt). (4.2.15) 


Using Equations (4.2.14), (4.1.12), and (4.1.13), the bandwidth? of the modulated 
signal is given by 


2(kpa + 1) fn, PM 


Be = UB + 1) fn = (4.2.16) 
ig 2 (“= + 1) Jm; FM 
Sin 
or 
pa eT Dn EM, (4.2.17) 
2(kja + fm), FM 


The preceding relation shows that increasing a, the amplitude of the modulating signal, 
in PM and FM has almost the same effect on increasing the bandwidth Be. On the other 
hand, increasing fm, the frequency of the message signal, has a more profound effect in 
increasing the bandwidth of a PM signal as compared to an FM signal. In both PM and FM, 
the bandwidth Be increases by increasing fm; but in PM, this increase is a proportional 
increase, and in FM, this is only an additive increase which usually (for large £) is not 


3For the remainder of the text, “bandwidth” refers to the effective bandwidth, unless otherwise stated. 
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Figure 4.6 The effect of doubling the bandwidth (frequency) of the message in FM and PM. 


substantial. Now if we look at the number of harmonics in the bandwidth (including the 
carrier) and denote it by Me, we have 


2|kpa|+3, PM 


Me =2([6) + D +1=218)+3= kya 
| >|“ | +3, FM 


m 


(4.2.18) 


In both cases, increasing the amplitude a increases the number of harmonics in the band- 
width of the modulated signal. However, increasing fm has no effect on the number of har- 
monics in the bandwidth of the PM signal, and it almost linearly decreases the number of 
harmonics in the FM signal. This explains the relative insensitivity of the FM-signal band- 
width to the message frequency. First, increasing f, decreases the number of harmonics in 
the bandwidth, and at the same time, it increases the spacing between the harmonics. The 
net effect is a slight increase in the bandwidth. In PM, however, the number of harmonics 
remains constant and only the spacing between them increases. Therefore, the net effect is 
a linear increase in bandwidth. Figure 4.6 shows the effect of increasing the frequency of 
the message in both FM and PM. 


4.2.2 Angle Modulation by an Arbitrary Message Signal 


The spectral characteristics of an angle-modulated signal for a general message signal m(t) 
is quite involved due to the nonlinear nature of the modulation process. However, there 
exists an approximate relation for the effective bandwidth of the modulated signal. This is 
known as Carson’s rule and is given by 


B: = 2(B + 1)W, (4.2.19) 
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where £ is the modulation index defined as 


kp max[|m(t)|], PM 


B= k,max{|m(t)l] as (4.2.20) 


and W is the bandwidth of the message signal m(t). Since wideband FM has a £ witha 
value that is usually around 5 or more, the bandwidth of an angle-modulated signal is much 
greater than the bandwidth of various amplitude-modulation schemes. This bandwidth is 
either W (in single sideband) or 2W (in double-sideband (DSB) or conventional AM). 


Example 4.2.2 


Assuming that m(t) = 10sinc(10*t), determine the transmission bandwidth of an FM- 
modulated signal with kr = 4000. 


Solution For FM, we have B, = 2(6 + 1)W. To find W, we have to find the spectrum of 
m(t). We have M(f) = 107311 (1074 f), which shows that m(t) has a bandwidth of 5000 Hz. 
Since the maximum amplitude of m(t) is 10, we have 


B= ky max[|m(t)[] S 4000 x 10 _ 8 
~ w ~ 5000 


and 


B. = 2(8 + 1) x 5000 = 90,000 Hz = 90 kHz. a 


4.3 IMPLEMENTATION OF ANGLE MODULATORS AND DEMODULATORS 


Any modulation and demodulation process involves the generation of new frequencies that 
were not present in the input signal. This is true for both amplitude- and angle-modulation 
systems. Thus, consider a modulator system with the message signal m(t) as the input 
and with the modulated signal u(t) as the output; this system has frequencies in its output 
that were not present in the input. Therefore, a modulator (and demodulator) cannot be 
modeled as a linear time-invariant (LTI) system, because an LTI system cannot produce 
any frequency components in the output that are not present in the input signal. 


Angle Modulators. Angle modulators are generally time-varying and nonlinear 
systems. One method for directly generating an FM signal is to design an oscillator whose 
frequency changes with the input voltage. When the input voltage is zero, the oscillator 
generates a sinusoid with frequency fe; when the input voltage changes, this frequency 
changes accordingly. There are two approaches to designing such an oscillator, usually 
called a VCO or voltage-controlled oscillator. One approach is to use a varactor diode. A 
varactor diode is a capacitor whose capacitance changes with the applied voltage. There- 
fore, if this capacitor is used in the tuned circuit of the oscillator and the message signal is 
applied to it, the frequency of the tuned circuit and the oscillator will change in accordance 
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+ Co Lo To oscillator circuit 
m(t) 
S Figure 4.7 Varactor-diode implementation 


= = eee of an angle modulator. 


with the message signal. Let us assume that the inductance of the inductor in the tuned 
circuit of Figure 4.7 is Lo and the capacitance of the varactor diode is given by 


C(t) = Co + kom(t). (4.3.1) 


= : oe ; 7 1 
When m(t) = 0, the frequency of the tuned circuit is given by fe = on [lata In general, 
fornonzero m(t), we have 

1 


tta/ Lo (Co + kom(t)) 
1 
~ n/L 


fit) = 


+ eme) 


2y eee (4.3.2) 


Ji+2 tome) 


Assuming that , 
0 

= — m(t 1 

€ gm ) < 


and using the approximations 








JIFeES1+ : (4.3.3) 
and 
: ~l-e, (4.3.4) 
l+e 
we obtain 
1 via € 
l+e 2 
Hence, 
Silt) © fe (1 = mo), (4.3.5) 


which is the relation for a frequency-modulated signal. 

A second approach for generating an FM signal is to use a reactance tube. In 
reactance-tube implementation, an inductor whose inductance varies with the applied volt- 
age is employed; the analysis is very similar to the analysis presented for the varactor 
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Figure 4.8 Generation of a narrowband angle-modulated signal. 


diode. Although we described these methods for the generation of FM signals, basically 
the same methods can be applied for the generation of PM signals (see Figure 4.1), due to 
the close relation between FM and PM signals. 

Another approach for generating an angle-modulated signal is to generate a nar- 
rowband angle-modulated signal and then change it to a wideband signal. This method is 
usually known as the indirect method for the generation of FM and PM signals. Due to the 
similarity of conventional AM signals, the generation of narrowband angle-modulated sig- 
nals is straightforward. In fact, any modulator for conventional AM generation can be eas- 
ily modified to generate a narrowband angle-modulated signal. Figure 4.8 shows the block 
diagram of a narrowband angle modulator. Next, we use the narrowband angle-modulated 
signal to generate a wideband angle-modulated signal. Figure 4.9 shows the block diagram 
of such a system. The first stage of this system is to create a narrowband angle modula- 
tor, such as the one shown in Figure 4.8. The narrowband angle-modulated signal enters 
a frequency multiplier which multiplies the instantaneous frequency of the input by some 
constant n. This is usually done by applying the input signal to a nonlinear element and 
then passing its output through a bandpass filter tuned to the desired central frequency. If 
the narrowband modulated signal is represented by 


Uun(t) = A-cos(2n fet + O(t)), (4.3.6) 
Frequency Frequency 
Input | Narrowband fe Frequency nfo BP | Output 
angle modulator xn filter | a 


Local | 
oscillator Frequency 


fio 


Figure 4.9 Indirect generation of angle-modulated signals. 
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the output of the frequency multiplier (which is the output of the bandpass filter) is 
given by 
y(t) = Ac cos(2nnfet + nọ (t)). (4.3.7) 


In general, this is a wideband angle-modulated signal. However, there is no guarantee that 
the carrier frequency of this signal, nfe, will be the desired carrier frequency. In the last 
stage, the modulator performs an up/down conversion to shift the modulated signal to the 
desired center frequency. This stage consists of a mixer and a bandpass filter. If the fre- 
quency of the local oscillator of the mixer is f{o and we are using a down converter, the 
final wideband angle-modulated signal is given by 


u(t) = A, cos (27 (nfe — fto)t tnd(t)). (4.3.8) 


Since we can freely choose n and fLo, we can generate any modulation index at any desired 
carrier frequency using this method. 


Angle Demodulators. FM demodulators are implemented by generating an AM 
signal, whose amplitude is proportional to the instantaneous frequency of the FM signal, 
and then using an AM demodulator to recover the message signal. To implement the first 
step, i.e., to transform the FM signal into an AM signal, it is enough to pass the FM signal 
through an LTI system, whose frequency response is approximately a straight line in the 
frequency band of the FM signal. If the frequency response of such a system is given by 


EISES- f) forlf - fel < = (439) 
and if the input to the system is 
u(t) = Ac cos (242 + 20k; f m(t) ar) ; (4.3.10) 
—00 
then the output will be the signal 
Vo(t) = Ac(Vo + kkfm(t)) cos (2x4 + 2k; is m(t) ar) : (4.3.11 
—00 


The next step is to demodulate this signal to obtain Ae (Vo + kk ¢m(t)), from which tht 
message m(t) can be recovered. Figure 4.10 shows a block diagram of these two steps. 

Many circuits can be used to implement the first stage of an FM demodulator, i.e. 
FM to AM conversion. One such candidate is a simple differentiator with 


IHI = xf. (4.3.12 


Output 


FM signal AM signal signal 
FEM to AM lsi AM Figure 4.10 A general FM 
convertor | | demodulator 
dt Aaa demodulator. 
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H(f)| 





Figure 4.11 A tuned circuit used in an 
fe f FM demodulator. 


Another candidate is the rising half of the frequency characteristics of a tuned circuit, 
as shown in Figure 4.11. Such a circuit can be easily implemented, but usually the linear 
region of the frequency characteristic may not be wide enough. To obtain linear charac- 
teristics over a wide range of frequencies, usually two circuits tuned at two frequencies 
fı and fə are connected in a configuration, which is known as a balanced discrimina- 
tor. A balanced discriminator with the corresponding frequency characteristics is shown in 
Figure 4.12. 

These FM-demodulation methods, which transform the FM signal into an AM sig- 
nal, have a bandwidth equal to the channel bandwidth B, occupied by the FM signal. 
Consequently, the noise that is passed by the demodulator is the noise contained within Be. 

A different approach to FM-signal demodulation is to use feedback in the FM demod- 
ulator to narrow the bandwidth of the FM detector and, as will be seen in Chapter 6, to 
reduce the noise power at the output of the demodulator. Figure 4.13 illustrates such a sys- 
tem. In this figure, the FM discriminator is placed in the feedback branch of a feedback 
system that employs a VCO path. 

The bandwidth of the discriminator and the subsequent lowpass filter are designed 
to match the bandwidth of the message signal m(t). The output of the lowpass filter is the 
desired message signal. This type of FM demodulator is called an FM demodulator with 
feedback (FMFB). An altemative to the FMFB demodulator is the use of a phase-locked 
loop (PLL), as shown in Figure 4.14 (PLLs are studied in detail in Section 8.8.1). 

The input to the PLL is the angle-modulated signal (we will neglect the presence of 
noise in this discussion) 


u(t) = A-cos[2n fet + o(t)], (4.3.13) 


where, for FM, 
t 


o(t) = ank; | m(t) dt. (4.3.14) 


The VCO generates a sinusoid of a fixed frequency; in this case, it generates the carrier 
frequency f,, in the absence of an input control voltage. 

Now, suppose that the control voltage to the VCO is the loop filter’s output, denoted 
as v(t). Then, the instantaneous frequency of the VCO is 


fot) = fe + kvv(t), (4.3.15) 
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Figure4.12 A balanced 
discriminator and the corresponding 
frequency response. 
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Figure 4.13 Block diagram of an FMFB demodulator. 
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Figure 4.14 Block diagram of 


a PLL-FM demodulator. 


where k, is a deviation constant with units of Hz/V. Consequently, the VCO output may be 
expressed as 
yy(t) = Ay sin[2n fet + dv(t)], (4.3.16) 


where 


v(t) = 2nk, [ v(t) dt. (4.3.17) 
0 


The phase comparator is basically a multiplier and a filter that rejects the signal 
component centered at 2 fe. Hence, its output may be expressed as 


e(t) = FAA sin[ġ (£) — p(t], (4.3.18) 


where the difference Ø (t) — ¢y(t) = $¢(t) constitutes the phase error. The signal e(t) is 
the input to the loop filter. 
Let us assume that the PLL is in lock position, so the phase error is small. Then, 


sin[ó (t) — pa (t)] © $ (t) — p(t) = ġelt) (4.3.19) 


under this condition, so we may deal with the linearized model of the PLL, shown in 
Figure 4.15. 


$Ò + Loop filter | ve . 


st) | 





t 
2nk,y I v(t) dr 
el Figure 4.15 Linearized PLL. 
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We may express the phase error as 


t 
belt) = $0) — 2k, it v(t) dt, (4.3.20) 
0 
or equivalently, either as 
d d 
nee + 2rk,v(t) = ae (4.3.21) 
or as ? a ; 
nee) + anky | dbe(t)g(t —t) dt = ane: (4.3.22) 
The Fourier transform of the integro-differential equation in Equation (4.3.22) is 
(J2uf)Oo(f) + 2tkyP.( PGF) = (G2 fO(/); (4.3.23) 
hence, 
1 
D(f) z D(f). (4.3.24) 
1+(#)Gy) 


The corresponding equation for the control voltage to the VCO is 


Vf) = ®.(f)G(f) 





= OU) 677: (4.3.25) 
1+(#)G/) 
Now, suppose that we design G(f) such that 
Ks Gh) >1 (4.3.26) 
if 








in the frequency band |f| < W of the message signal. Then, from Equation (4.3.25), we 
have 


j2 
Vf) = Plein, (4.3.27) 
or equivalently, 
1 d 
v(t) = Ink, ne 
= Lit) (4.3.28) 


Since the control voltage of the VCO is proportional to the message signal, v(t) is the 
demodulated signal. 

We observe that the output of the loop filter with the frequency response G(f) is 
the desired message signal. Hence, the bandwidth of G(f) should be the same as the 
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bandwidth W of the message signal. Consequently, the noise at the output of the loop 
filter is also limited to the bandwidth W. On the other hand, the output from the VCO 
is a wideband FM signal with an instantaneous frequency that follows the instantaneous 
frequency of the received FM signal. 

The major benefit of using feedback in FM-signal demodulation is to reduce the 
threshold effect that occurs when the input signal-to-noise-ratio to the FM demodulator 
drops below a critical value. The threshold effect is treated in Chapter 6. 


4.4 FM RADIO BROADCASTING 


Commercial FM radio broadcasting utilizes the frequency band 88—108 MHz for the trans- 
mission of voice and music signals. The carrier frequencies are separated by 200 kHz 
and the peak frequency deviation is fixed at 75 kHz. Preemphasis is generally used, as 
described in Chapter 6, to improve the demodulator performance in the presence of noise 
in the received signal. 

The receiver most commonly used in FM radio broadcasting is a superheterodyne 
type. The block diagram of such a receiver is shown in Figure 4.16. 

As in AM radio reception, common tuning between the RF amplifier and the local 
oscillator allows the mixer to bring all FM radio signals to a common IF bandwidth of 
200 kHz, centered at fp = 10.7 MHz. Since the message signal m(t) is embedded in 
the frequency of the carrier, any amplitude variations in the received signal are a result of 
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Figure 4.16 Block diagram of a superheterodyne FM radio receiver. 
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additive noise and interference. The amplitude limiter removes any amplitude variations in 
the received signal at the output of the IF amplifier by hardlimiting the signal amplitude. 
A bandpass filter, which is centered at fp = 10.7 MHz with a bandwidth of 200 kHz, 
is included in the limiter to remove higher-order frequency components introduced by the 
nonlinearity inherent in the hard limiter. 

A balanced frequency discriminator is used for frequency demodulation. The result- 
ing message signal is then passed to the audio-frequency amplifier, which performs the 
functions of deemphasis and amplification. The output of the audio amplifier is further fil- 
tered by a lowpass filter to remove out-of-band noise, and this output is used to drive a 
loudspeaker. 


FM Stereo Broadcasting. Many FM radio stations transmit music programs in 
stereo by using the outputs of two microphones placed on two different parts of the stage. 
Figure 4.17 shows a block diagram of an FM stereo transmitter. The signals from the left 
and right microphones, m; (t) and m,(t), are added and subtracted as shown. The sum signal 
m(t) +m,(t) is left unchanged and occupies the frequency band 0-15 kHz. The difference 
signal mı(t) — m,(t) is used to AM modulate (DSB-SC) a 38-kHz carrier that is generated 
from a 19-kHz oscillator. A pilot tone at the frequency of 19 kHz is added to the signal for 
the purpose of demodulating the DSB-SC AM signal. We place the pilot tone at 19 kHz 
instead of 38 kHz because the pilot is more easily separated from the composite signal at 
the receiver. The combined signal is used to frequency modulate a carrier. 

By configuring the baseband signal as an FDM signal, a monophonic FM receiver 
can recover the sum signal m; (t) + m,(t) by using a conventional FM demodulator. Hence, 
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Figure 4.17 FM stereo transmitter and signal spacing. 
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Figure 4.18 FM-stereo receiver. 


FM stereo broadcasting is compatible with conventional FM. In addition, the resulting FM 
signal does not exceed the allocated 200-kHz bandwidth. 

The FM demodulator for FM stereo is basically the same as a conventional FM 
demodulator down to the limiter/discriminator. Thus, the received signal is converted to 
baseband. Following the discriminator, the baseband message signal is separated into the 
two signals, mj (t) + m;(t) and m(t) — m,(t), and passed through de-emphasis filters, as 
shown in Figure 4.18. The difference signal is obtained from the DSB-SC signal via a syn- 
chronous demodulator using the pilot tone. By taking the sum and difference of the two 
composite signals, we recover the two signals, mm (t) and m,(t). These audio signals are 
amplified by audio-band amplifiers, and the two outputs drive dual loudspeakers. As indi- 
cated, an FM receiver that is not configured to receive the FM stereo sees only the baseband 
signal mı(t) + m(t) in the frequency range 0-15 kHz. Thus, it produces a monophonic 
output signal that consists of the sum of the signals at the two microphones. 


4.5 SUMMARY AND FURTHER READING 


This chapter covered frequency modulation (FM) and phase modulation (PM) for analog 
signal transmission. The spectral characteristics of FM and PM signals were described, as 
well as their bandwidth occupancy. The implementation of FM and PM modulators and 
demodulators was also covered. In the final topic of this chapter, we described the use of 
FM in radio broadcasting. 

As in the case of amplitude modulation, angle modulation (FM and PM) is treated in 
numerous books on communication systems, including those that were previously cited in 
Section 3.6. 
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PROBLEMS 


4.1 The message signal m(t) = 10sinc(400t) frequency modulates the carrier 
i c(t) = 100 cos 2x f-t. The modulation index is 6. 

1. Write an expression for the modulated signal u(t). 

2. What is the maximum frequency deviation of the modulated signal? 

3. What is the power content of the modulated signal? 

4. Find the bandwidth of the modulated signal. 






4. 2 ‘Signal m(t) is shown in Figure P-4.2; this signal is used once to frequency modulate 
; la carrier and once to phase modulate the same carrier. 








Figure P-4.2 


! 1. Find a relation between k, and ky such that the maximum phase of the modu- 
lated signals in both cases are equal. 








2. Ifkp = fa = 1, what is the maximum instantaneous frequency in each case? 
NE 


4.3 Determine the in-phase and quadrature components as well as the envelope and the 
phase of FM- and PM-modulated signals. 


4.4 An-angle-modulated signal has the form 
u(t) = 100cos [2x fet + 4 sin 20001] , 
where fs = 10 MHz. 


1. Determine the average wansmitted power. 


2. Determine the peak-phase deviation. 





3. Determine the peak-frequency deviation. 
4. Is this an FM or a PM signal? Explain. 








: 4.5 Find the smallest value of the modulation index in an FM system that guarantees 
that all the modulated-signal power is contained in the sidebands and no power is 
transmitted at the carrier frequency when the modulating signal is a sinusoid. 
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4.6 To generate wideband FM, we can first generate a narrowband FM signal, and then 
use frequency multiplication to spread the signal bandwidth. Figure P-4.6 illustrates 
such a scheme, which is called an Armstrong-type FM modulator. The narrowband 
FM signal has a maximum angular deviation of 0.10 radians to keep distortion under 
control. 


1. If the message signal has a bandwidth of 15 kHz and the output frequency from 
the oscillator is 100 kHz, determine the frequency multiplication that is neces- 
sary to generate an FM signal at a carrier frequency of fe = 104 MHz and a 
frequency deviation of f = 75 kHz. 


2. If the carrier frequency forthe wideband FM signal is to be within +2 Hz, deter- 
mine the maximum allowable drift of the 100 kHz oscillator. 


m(t) Narrowband Frequency Wideband FM signal 
FM modulator multipliers 
| | P f, = 104 MHz 


Acos(2nfpt) 


Frequency | 
multipliers | 


fa = 100 kHz 


g 


Figure P-4.6 Armstrong-type FM modulator. 


4.7 Determine the amplitude and phase of the various frequency components of a PM 
signal with k, = 1 and with m(t) a periodic signal given by 


1, 0<t<% 


m(t) = 
-1, B<t<T 


in one period. 


4.8 An FM signal is given as 


t 


u(t) = 100 cos EZ + 100 f 


—00 


m (T) ar] ; 
where m (t) is shown in Figure P-4.8. 


1. Sketch the instantaneous frequency as a function of time. 


2. Determine the peak-frequency deviation. 
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Figure P-4.8 


4.9 The carrier c(t) = 100cos2nf,t is frequency modulated by the signal m(t) = 
5 cos 20,000nt, where f; = 10° Hz. The peak-frequency deviation is 20 kHz. 


1. Determine the amplitude and frequency of all signal components that have a 
power level of at least 10% of the power of the unmodulated carrier component. 


2. From Carson’s rule, determine the approximate bandwidth of the FM signal. 
4.10 The carrier c(t) = Acos2110°t is angle modulated (PM or FM) by the sinu- 


soid signal m(t) = 2cos 2000nt. The deviation constants are kp = 1.5 rad/V and 
k; = 3000 Hz/V. 








1. Determine fp and £p. 


2. Determine the bandwidth in each case using Carson’s rule. 


3. Plot the spectrum of the modulated signal in each case. (Plot only those fre- 
quency components that lie within the bandwidth derived in Part 2.) 


4. If the amplitude of m(t) is decreased by a factor of 2, how would your answers 
to Parts 1-3 change? 


5. If the frequency of m(t) is increased by a factor of 2, how would your answers 
to Parts 1-3 change? 





4.11 The carrier c(t) = 100cos2xf,t is phase modulated by the signal m(t) = 
| 5 cos 2000xż. The PM signal has a peak-phase deviation of x/2. The carrier fre- 
quency is f, = 10° Hz. 


1. Determine the magnitude spectrum of the sinusoidal components and sketch the 
results. 


2. Using Carson’s rule, determine the approximate bandwidth of the PM signal and 
compare the result with the analytical result in Part 1. 
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4.12 An angle-modulated signal has the form 
u(t) = 100 cos [2n fet + 4sin 27 fmt], 
where fe = 10 MHz and fm = 1000 Hz. 


1. Assuming that this is an FM signal, determine the modulation index and the 
transmitted-signal bandwidth. 


2. Repeat Part 1 if fm is doubled. 


3. Assuming that this is a PM signal, determine the modulation index and the 
transmitted-signal bandwidth. 


4. Repeat Part 3 if fn is doubled. 


4.13 Itis easy to demonstrate that amplitude modulation satisfies the superposition princi- 
ple, whereas angle modulation does not. To be specific, let mı (t) and m2 (t) represent 
two message signals and let u; (t) and u2(t) represent the corresponding modulated 
versions. 


1. Show that when the combined message signal m(t) + m2(t) DSB modulates a 
carrier A, cos 27 fet, the result is the sum of the two DSB amplitude-modulated 
signals u; (t) + u2(t). 


2. Show that if mı (t) + m2 (t) frequency modulates a carrier, the modulated signal 
is not equal to u(t) + u2(t). 


4.14 An FM discriminator is shown in Figure P-4.14. The envelope detector is assumed 
to be ideal and has an infinite input impedance. Select the values for L and C if the 
discriminator is to be used to demodulate an FM signal with a carrier fe = 80 MHz 
and a peak frequency deviation of 6 MHz. 






Envelope 
detector 


m(t) 


Figure P-4.14 


4.15 An angle-modulated signal is given as 
u(t) = 100 cos [2000nt + ¢(t)], 


where (a) ¢ (t) = 5 sin 20xt and (b) (t) = 5cos20nt. Determine and sketch the 
amplitude and phase spectra for (a) and (b), and compare the results. 
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4.16 The message signal m(t) into an FM modulator with a peak frequency deviation 
fa = 25 Hz2/V is shown in Figure P-4.16. Plot the frequency deviation in Hz and the 
phase deviation in radians. 


Figure P-4.16 


4.17 A message signal m(t) has a bandwidth of 10 kHz and a peak magnitude |m(t)| 
of 1 volt. Estimate the bandwidth of the signal u(t) obtained when m(t) frequency 
modulates a carrier with a peak-frequency deviation of (a) fa = 10 Hz/V, (b) 100 
Hz/V, and (c) 1000 Hz/V. 


4.18 The modulating signal that is the input into an FM modulator is 
m(t) = 10cos 16nt. 


The output of the FM modulator is 


t 


u(t) = 10cos | 4000% + 20k ¢ J 


—00 


m (T) ar], 


where ky = 10. (See Figure P-4.18.) If the output of the FM modulator is passed 
through an ideal BPF centered at f, = 2000 with a bandwidth of 62 Hz, determine 
the power of the frequency components at the output of the filter. What percentage 
of the transmitter power appears at the output of the BPF? 





FM 
m(t) demodulator Output 
= f, = 2000 Hz > : ae 


kp= 10 





Figure P-4.18 


4.19 The message signal m; (t) is shown in Figure P-4.19 and the message signal m2(t) = 
sinc(2 x 10+) volts. 
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m (Â volts | 





4 
(sec) Figure P-4.19 


1. If m,(t) is frequency modulated on a carrier with frequency 10° Hz and a 
frequency-deviation constant (k f) equal to 5 Hz/V, what is the maximum instan- 
taneous frequency of the modulated signal? 


2. If m (t) is phase modulated with a phase-deviation constant k, = 3 rad/V, what 
is the maximum instantaneous frequency of the modulated signal? What is the 
minimum instantaneous frequency of the modulated signal? 


3. If m2(t) is frequency modulated with kr = 10° Hz/V, what is the maximum 
instantaneous frequency of the modulated signal? What is the bandwidth of the 
modulated signal? 


4.20 A superheterodyne FM receiver operates in the frequency range of 88-108 MHz. 
The IF and local-oscillator frequencies are chosen such that fip < fio. We require 
that the image frequency f! fall outside of the 88-108 MHz region. Determine the 
minimum required fi and the range of variation in fio. 





COMPUTER PROBLEMS 


4.1 Frequency Modulation 


The message signal 
1 O<t<%/3 
m(t) = {—2 to/3 <t < 2to/3 
0 otherwise 


frequency modulates the carrier c(t) = cos 2x fet, where fe = 200 Hz and t = 
0.15 sec. The frequency-deviation constant is kf = 50. Therefore, the frequency- 
modulated signal is 


u(t) = cos (asa + 20k ¢ J m(t) ar) ; 


1. Plot the message signal m(t) and its integral on two separate graphs. The sam- 
pling interval is ts = 0.0001. 
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2. Plot the FM signal u(t). 

3. Use MATLAB’s Fourier-transform routine to compute and plot the spectra of 
m(t) and u(t) on separate plots. 

4. Suppose we define the bandwidth W of m(t) as the width of the main lobe 


of its spectrum. Determine the modulation index 6 and the modulation-signal 
bandwidth Be using Carson’s rule. 


4.2 Frequency Modulation 


The message signal 
sinc(100t) |t| < to 
m(t) = , 
0 otherwise 


frequency modulates the carrier c(t) = cos(2n f-t), when fe = 250 Hzand to = 0.1. 
The frequency-deviation constant is kr = 100. Therefore, the frequency-modulated 
signal is 
t 
u(t) = cos (2x4 + 2k ¢ i m(T) ar) : 
—00 


1. Plot the message signal and its integral on two separate graphs. The sampling 
interval is t; = 0.0001. 

2. Plot the FM signal u(t). 

3. Use MATLAB’s Fourier-transform routine to compute and plot the spectra of 
m(t) and u(t) on separate graphs. 

4. Demodulate the FM signal u(t) to obtain the message signal and compare the 
result with the original message signal. The FM signal can be demodulated by 
first finding the phase of u(t), i.e., the integral of m(t), which can be differenti- 
ated and divided by 21k p to yield m(t). Use MATLAB function unwrap . m to 
undo the effect of 21-phase foldings. Comment on how well the demodulated 
message signal matches the original message signal m (t). 


4.3 Frequency Modulation 


The message signal 


t 0<t<1 
m(t)= ¢-t+2 1<t<2 
0 otherwise 


frequency modulates the carrier c(t) = cos(2nf,t), when fe = 1000 Hz. The 
frequency-deviation constant is k = 25. 


1. Plot the message signal and its integral on two separate graphs. 
2. Plotthe FM signal 


u(t) = cos (2x40 + 20k ¢ [ m(T) az) . 
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3. 


4. 


Use MATLAB’s Fourier-transform routine to compute and plot the spectra of 
m(t) and u(t) on separate graphs. 


Determine the modulation index, the bandwidth, and the range of the instanta- 
neous frequency of u(t). 


. Demodulate the FM signal u(t) to obtain the message signal and compare the 


result with the original message signal. The FM signal can be demodulated by 
first finding the phase of u(t), i.e., the integral of m(t), which can be differenti- 
ated and divided by 21k; to yield m(t). Use the MATLAB function unwrap . m 
to undo the effect of 21-phase foldings. Comment on how well the demodulated 
message signal matches the original message signal m(t). 
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i This chapter is devoted to a review of the basics of probability and the study of random 
| processes and their properties. Random processes provide good models for both informa- 
| tion sources and noise. When a signal is transmitted through a communication channel, 
there are two types of imperfections that cause the received signal to be different from the 
transmitted signal. One class of imperfections is deterministic in nature, such as linear and 
nonlinear distortion, intersymbol interference, and so on. The second class is nondetermin- 
istic, such as the addition of noise, multipath fading, and so on. For a quantitative study of 
these phenomena, we model them as random processes. 

The information that is to be transmitted is best modeled as a random process. This is 
because any signal that conveys information must have some uncertainty in it, otherwise its 
| transmission is of no interest. We will explore this aspect in greater detail in Chapter 12. In 
l this chapter, we will briefly review the basics of probability theory and random variables, 
| then we will introduce the concept of a random process and the basic tools used in analysis 
of random processes. 





5.1 REVIEW OF PROBABILITY AND RANDOM VARIABLES 


I In this section, we will briefly review some basics of probability theory that are needed 
| for our treatment of random processes. Throughout the text, we assume that the reader 
has already been exposed to probability theory elsewhere; therefore, our treatment in this 
section will be brief. 





i 5.1.1 Sample Space, Events, and Probability 









































The fundamental concept in any probabilistic model is the concept of a random experi- 
ment, which is any experiment whose outcome cannot be predicted with certainty. Flipping 
a coin, throwing a die, and drawing a card from a deck of cards are examples of random 
experiments. What is common in all these cases is that the result (or outcome) of the exper- 
iment is uncertain. A random experiment has certain outcomes, which are the elementary 
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results of the experiment. In flipping of a coin, “head” and “tail” are the possible outcomes. 
In throwing a die, 1,2,3,4,5, and 6 are the possible outcomes. The set of all possible out- 
comes is called the sample space and is denoted by &2. Outcomes are denoted by w’s, and 
each w lies in Q, i.e., w € Q. 

A sample space is discrete if the number of its elements are finite or countably infi- 
nite,! otherwise it is a nondiscrete sample space. All the random experiments given above 
have discrete sample spaces. If we randomly choose a number between 0 and 1, then the 
sample space corresponding to this random experiment is the set of all numbers between 0 
and 1, which is infinite and uncountable. Such a sample space is nondiscrete. 

Events are subsets of the sample space; in other words, an event is a collection of 
outcomes. For instance, in throwing a die, the event “the outcome is odd” consists of out- 
comes 1, 3, and 5; the event “the outcome is greater than 3” consists of outcomes 4, 5, and 
6; and the event “the outcome is a multiple of 4” consists of the single outcome 4. For the 
experiment of picking a number between 0 and 1, we can define an event as “the outcome 
is less than 0.7,” or “the outcome is between 0.2 and 0.5,” or “the outcome is 0.5.” Events 
are disjoint if their intersection is empty. For instance, in throwing a die, the events “the 
outcome is odd” and “the outcome is a multiple of 4” are disjoint. 

We define a probability P as a set function assigning nonnegative values to all events 
E such that the following conditions are satisfied: 


1. 0 < P(E) < 1 for all events. 

2. P(Q)=1. 

3. For disjoint events E1, E2, E3, -+ - (i.e., events for which E; N E; = Ø for alli Æ j, 
where Ø is the empty set), we have P(UR E) = } jc, P (Ei). 


Some basic properties of probability follow easily from the set theoretical properties 
of events combined with the above three conditions. Some of the most important properties 
are as follows: 


1. P(E°) = 1 — P(E), where E° denotes the complement of E. 
2. PŒ) =0. 

3. P(E, U E2) = P (E1) + P (E2) — P(E; 2 E2). 

4. If E; C E then P (E1) < P(E2). 


5.1.2 Conditional Probability 


Let us assume that the two events, E; and E2, have probabilities P (E1) and P (E2). If an 
observer knows that the event E2 has occurred, then the probability that event E; will occur 
will not be P(£,) anymore. In fact, the information that the observer receives changes the 


'Countably infinite means infinite but enumerable, i.e., the number of outcomes is infinite but it can be 
put in one-to-one correspondence with the set of natural numbers, i.e., it can be counted. 
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probabilities of various events; thus, new probabilities, called conditional probabilities, are 
defined. The conditional probability of the event E1, given the event Ez, is defined by 


P(E, N E2) 


, P(E 0 
P(E\IE2) =} pe PEP FO | 
0, otherwise 


(5.1.1) 


If it happens that P(E,|E2) = P (E1), then the knowledge of E2 does not change the prob- 
ability of E1. In this case, the events E, and E2 are said to be independent. For independent 
events, P(E, O E2) = P(E))P(E2). 


Example 5.1.1 
In throwing a fair die, the probability of 
A = {The outcome is greater than 3} 
is i 
P(A) = P (4) + P(5) + P (6) = 7 


The probability of 
B = {The outcome is even} 


is 


P(B) = P(2)+ P(4)+ P(6) = > 
In this case, 
P(ANB P(4)+ P(6 2 


2 

If the events {E;}?_, are disjoint and their union is the entire sample space, then 
they make a partition of the sample space Q. Then, if for an event A, we have the condi- 
tional probabilities {P(A|E;)}?_,, P(A) can be obtained by applying the total probability 
theorem stated as 


P(A) = J` P(E) P(AIE}). (5.1.2) 
i=1 ` 


Bayes’s rule gives the conditional probabilities P (E;|A) by the following relation: 


P(E}) P(AIEi) 


Via P(Ej) P(AIE;) 


P(E;|A) = (5.1.3) 


Example 5.1.2 : 
In a certain city, 50% of the population drive to work, 30% take the subway, and 20% take the 
bus. The probability of being late for those who drive is 10%, for those who take the subway 
is 3%, and for those who take the bus is 5%. 


1. What is the probability that an individual in this city will be late for work? 
2. If an individual is late for work, what is the probability that he drove to work? 
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Solution Let D, S, and B denote the events of an individual driving, taking the subway, or 
taking the bus. Then P(D) = 0.5, P(S) = 0.3, and P(B) = 0.2. If L denotes the event of 
being late, then, from the assumptions, we have 





P(L|D) =0.1; 
P(L|S) = 0.03; 
P(L|B) = 0.05. 


1. From the total probability theorem, 
P(L) = P(D)P(L|D) + P(S)P(L|S) + P(B)P(LIB) 
= 0.5 x 0.1 + 0.3 x 0.03 + 0.2 x 0.05 
= 0.069. 








2. Applying Bayes’s rule, we have 





P(DIL) = P(D)P(L|D) 
P(D)P(L|D) + P(S) P(LIS) + P(B)P(L|B) 
_ 0.05 
~ 0.069 
~ 0.725. = 


Example 5.1.3 


In a binary communication system, the input bits transmitted over the channel are either 0 or 1 
with probabilities 0.3 and 0.7, respectively. When a bit is transmitted over the channel, it can 
be either received correctly or incorrectly (due to channel noise). Let us assume that if a 0 is 
transmitted, the probability of it being received in error (i.e., being received as 1) is 0.01, and if 
a 1 is transmitted, the probability of it being received in error (i.e., being received as 0) is 0.1. 


1. What is the probability that the output of this channel is 1? 


2. Assuming we have observed a 1 at the output of this channel, what is the probability that 
the input to the channel was a 1? 


Solution Let X denote the input and Y denote the output. From the problem assumptions, 


we have 
P(X = 0) = 0.3; P(X = 1) = 0.7; 
P(Y =0|X =0) = 0.99; P(Y = 1|X = 0) = 0.01; 
P(Y =0|X = 1) =0.1; P(Y = 1|X = 1) = 0.9. 


1. From the total probability theorem, we have 
P(Y =1)= P(Y =1,X =0)+ P(Y =1,X =1) 
= P(X =0)P(Y = 1|X =0)+ P(X =1)P(Y = 1X = 1) 
=0.3 x 0.01 +0.7 x 0.9 
_= 0.003 + 0.63 
= 0.633. 
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2. From the Bayes rule, we have 
P(X =1)P(Y =1|X = 1) 
P(X =0)P(Y = 1|X =0)+ P(X =1)P(Y = 1|X = 1) 
0.7 x 0.9 

= 03x 0.01+0.7 x 0.9 

0.63 
= 0.633 
= 0.995. C 





P(X =1|Y =1)= 


5.1.3 Random Variables 


A random variable is a mapping from the sample space Q to the set of real numbers. 
In other words, a random variable is an assignment of real numbers to the outcomes of 
a random experiment. A schematic diagram representing a random variable is given in 
Figure 5.1. 


Example 5.1.4 
In throwing dice, the player wins the amount that the die shows if the result is even and loses 
that amount if it is odd; then the random variable denoting his gain is 


Xo) = w w=2,4,6. k 
—w w=1,3,5 
Random variables are denoted by capital letters, i.e., X, Y, and so on, and individual 
values of the random variable X are X (œw). A random variable is discrete if the range of 
its values is either finite or countably infinite. This range is usually denoted by {x;}. A 
continuous random variable is one in which the range of values is a continuum. 
The cumulative distribution function or CDF of a random variable X is defined as 


Fx (x) = P{fwa@ EQ: X(@) <x}, 
which can be simply written as 


Fx (x) = P(X < x) 


ne Figure 5.1 A random variable as a 
R 


Xlo)  X(@1) Xos) X(w4) mapping from & to R. 
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and has the following properties: 


. 0< Fx(x) <1. 

. Fx(x) is nondecreasing. 

. limy- Fx(x) = 0 and lim,400 Fx (x) = 1. 

. Fx(x) is continuous from the right, i.e., limgyo F(x + €) = F(x). 
. P(a < X <b) = Fy(b) — Fx(a). 

. P(X =a) = Fx (a) — Fx (a7). 


An BW DN = 


For discrete random variables, F(x) is a staircase function. A random variable is contin- 
uous if Fy (x) is a continuous function. A random variable is mixed if it is neither discrete 
nor continuous. Examples of CDFs for discrete, continuous, and mixed random variables 
are shown in Figures 5.2, 5.3, and 5.4, respectively. 

The probability density function, or PDF, of a continuous random variable X is 
defined as the derivative of its CDF. It is denoted by fx(x), i.e., 


d 
fx(x) = —Fx(x). (5.1.4) 
dx 
The basic properties of PDF are as follows: 


1. fx(x) > 0. 
2. Jz fx(x) dx =1. 


F(x) ì 





> Figure5.2 CDF fora discrete 
x random variable. 





Figure 5.3 CDF fora 
X continuous random variable. 
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srs ty, Figure 5.4 CDF for a mixed 


x random variable. 


3. J? fx(x) dx = Pla < X < b). 
4. In general, P(X € A) = f, fx (x) dx. 


5. Fx(x) = JZ fx) du. 


For discrete random variables, we generally define the probability mass function, or PMF 
which is defined as {p;}, where p; = P(X = x;). Obviously, for all i, we have p; > 0 anc 


di pi =1. 


Important Random Variables. In communications, the most commonly usec 
random variables are the following: 


Bernoulli random variable. This is a discrete random variable taking two values 
1 and 0, with probabilities p and 1 — p. A Bemoulli random variable is a good model fo: 
a binary-data generator. When binary data is transmitted over a communication channel 
some bits are received in error. We can model an error by modulo-2 addition of a 1 to th: 
input bit; thus, we change a 0 into a 1 and a 1 into a 0. Therefore, a Bernoulli randon 
variable can also be employed to model the channel errors. 


Binomial random variable. This is a discrete random variable giving the numbe 
of 1’s in a sequence of n-independent Bemoulli trials. The PMF is given by 


n\ k n—k 
1— , O<k< 
P(X =k)= (o) p) eal 
0, otherwise 


(5.1.5 


This random variable models, e.g., the total number of bits received in error when . 
sequence of n bits is transmitted over a channel with a bit-error probability of p. 


Example 5.1.5 
Assume 10,000 bits are transmitted over a channel in which the error probability is 10-3. Whz 
is the probability that the total number of errors is less than 3? 
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Solution In this example, n = 10,000, p = 0.001, and we are looking for P(X < 3). We 
have 


P(X <3) = P(X =0)+ P(X =1)+ P(X =2) 


10,000 
0 


10,000 
EA a 


o.001% — 0.001) 10000 
0.001 (i — 0001" 


+ (eo Jooo (1 — 0.001)!®000-2 


= 0.0028. a 


Uniform random variable. This is a continuous random variable taking values 
between a and b with equal probabilities for intervals of equal length. The density function 
is given by 

1 
fx(x) = 4 b-a 
0, otherwise 


,a<x<b 


This is a model for continuous random variables whose range is known, but nothing else is 
known about the likelihood of the values that the random variable can assume. For example, 
when the phase of a sinusoid is random, it is usually modeled as a uniform random variable 
between 0 and 27. 


Gaussian or normal random variable. The Gaussian, or normal, random variable 
is a continuous random variable described by the density function 


_ =m)? 


fx(x) = e 2% . (5.1.6) 





1 
/2n0 
There are two parameters involved in the definition of the Gaussian random variable. The 
parameter m is called the mean and can assume any finite value. The parameter o is called 
the standard deviation and can assume any finite and positive value. The square of the 
standard deviation, i.e., «, is called the variance. A Gaussian random variable with mean 
m and variance g? is denoted by Nim, o°). The random variable A’ (0, 1) is usually called 
standard normal. 

The Gaussian random variable is the most important and frequently encountered 
random variable in communications. The reason is that thermal noise, which is the major 
source of noise in communication systems, has a Gaussian distribution. The properties of 
Gaussian noise will be investigated in more detail later in this chapter. 

We have graphed the PDF and PMF of the above random variables. They are given 
in Figures 5.5-5.8. 

Assuming that X is a standard normal random variable, we define the function Q(x) 
as P(X > x). The Q-function is given by the relation 


edt. (5.1.7) 





om) =Px>n= f 
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Figure 5.6 The PMF forthe binomial random variable. 
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Figure5.5 The PMF for the 


* Bernoulli random variable. 





Figure5.7 The PDF for the uniform random 


x variable. 
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fx(x) 


1 
V2n07 





0 Figure 5.8 The PDF for the 
m X Gaussian random variable. 


Figure 5.9 The Q-function as the area 
under the tail of a standard normal random 
variable. 





This function represents the area under the tail of a standard normal random variable, 
as shown in Figure 5.9. From this figure, it is clear that the Q-function is a decreasing 
function. This function is well tabulated and frequently used in analyzing the performance 
of communication systems. It is easy to see that Q(x) satisfies the following relations: 


Q(—x) = 1 — Q(x); 
1 
Q0) = 53 


Q(co) = 0. 
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TABLE 5.1 TABLE OF THE Q-FUNCTION 
0 5.000000e-01 2.4 8.197534e—03 4.8 7.933274e-07 


0.1 4.601722e-01 2.5 6.209665e-03 49 4.791830e-07 
0.2 4.207403e-01 2.6 4.661189e-03 5.0 2.86651 6e-07 
0.3 3.820886e-01 2.7 3.466973e-03 5.1 1.698268e-07 
0.4 3.445783e-01 28 2.55513 1e-03 5.2 9.964437e-06 
0.5 3.085375e-01 2.9 1.865812e-03 5.3 5.790128e-08 
0.6 2.742531e-01 3.0 1.349898e-03 5.4 3.332043e-08 
0.7 2.419637e-01 3.1 9.676035e-04 5.5 1.898956e-08 
0.8 2.118554e-01 3.2 6.871378e-04 5.6 1.071760e-08 
0.9 1.840601e-01 3.3 4.834242e-04 5.7 5.990378e-09 
1.0 1.586553e-01 3.4 3.36929 1e-04 5.8 3.315742e-09 
1.1 1.356661e-01 3.5 2.326291e-04 5.9 1.817507e-09 
1.2 1.150697e-01 3.6 1.591086e-04 6.0 9.865876e-10 
13 9.680049e-02 3.7 1.077997e-04 6.1 5.303426e-10 
1.4 8.075666e-02 3.8 7.234806e-05 6.2 2.823161e—10 
1.5 6.680720e-02 3.9 4.809633e-05 6.3 1.488226e-1C 
1.6 5.479929e-02 4.0 3.167124e-05 6.4 7.168843e-11 
17 4.456546e-02 4.1 2.065752e-05 6.5 4.016001e-11 
1.8 3.593032e-02 42 1.334576e-05 6.6 2.055790e-11 
1.9 2.871656e-02 43 8.539898e-06 6.7 1.042099e-1] 
2.0 2.275013e-02 4.4 5.412542e-06 6.8 5.23095le-12 
2.1 1.786442e-02 4.5 3.397673e-06 6.9 2.600125e-17 
2.2 1.390345e-02 4.6 2.112456e-06 7.0 1.279813e-17 
23 1.0724 11le-02 47 1.300809e-06 


Table 5.1 gives the values of this function for various values of x. Two important uppe: 
bounds on the Q-function are widely used to find bounds on the error probability of variou: 
communication systems. These bounds are given as 








x2 
Q(x) < sc forall x > 0 (5.1.8 
and 
Q(x)<—e"F for allx >0 (5.1.9 
x) < e 2 or all x > 0. els 
V20X 
A frequently used lower bound is 
Q(x) l (1 zee for all 1 (5.1.10 
x) > ——+]e orallx > 1. T 
vV 2nx x? 


A plot of Q(x) and its bounds is given in Figure 5.10. 

For an N (m, o?) random variable, a simple change of variable in the integral thé 
computes P(X > x) results in P(X > x) = Q (==). This gives the so-called ta 
probability in a Gaussian random variable. 
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Q(x) 





Figure 5.10 Bounds on the Q-function. 


Example 5.1.6 


X is a Gaussian random variable with mean 1 and variance 4. Find the probability that X is 
between 5 and 7. 


Solution We have m = 1 and o = V4 = 2. Thus, . 
P(5 < X <7)= P(X >5)— P(X >7) 
5-1 7—1 
=o( Ta z 
=0(2)-93) — 
0.0214. C 


5.1.4 Functions of a Random Variable 


A function of a random variable Y = g(X) is itself a random variable. In general, to find 
the CDF of Y = g(X), we can use the definition of the CDF to obtain 


Fr) = P{w E Q: g(X(@)) < x}. 


In the special case that, for all y, the equation g(x) = y has a countable number of solutions 
{x;}, and for all these solutions, g’(x;) exists and is nonzero, we can find the PDF of the 
random variable Y = g(X) with the following relation: 


O=) pan (5.1.11) 





i 
Example 5.1.7 


Assuming X is a Gaussian random variable with m = 0 and ø = 1, find the probability 
density function of the random variable Y given by Y = aX + b. 
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Solution In this case, g(x) = ax + b; therefore, g'(x) = a. The equation ax + b = y has 
only one solution, which is given by x, = =, Using these results, we obtain 





fx 7) 
fry) = ial 
1 _ 0-b)? 
= e a2 (5.1.12) 
~ 27a? 
It is observed that Y is a Gaussian random variable N (b, a?). E 


Using an approach similar to the one used in the preceding example, we can show 
that if X is N (m, o?), then Y = aX + b is also a Gaussian random variable of the form 


N (am + b, a?o?). 


Example 5.1.8 
Assume X is a N (3, 6) random variable. Find the density function of Y = —2X + 3. 


Solution We know Y is a Gaussian random variable with the mean m = —2 x 3 +3 = —3 
and variance o? = 4 x 6 = 24. Therefore, Y is a N (—3, 24) random variable and 


1 0+3 
fO) = -=e 8 m 
y 48T 
From this example, we arrive at the important conclusion that a linear function of a Gaus- 
sian random variable is itself a Gaussian random variable. 


Statistical Averages. The mean, expected value, or expectation of the random 
variable X is defined as a X 
E(X) = / xf x(x) dx (5.1.13) 
—00 
and is also denoted by my. The expected value of a random variable is a measure of the 
average of the values that the random variable takes in a large number of experiments. 
Note that E (X) is just a real number. In general, the nth moment of a random variable X 
is defined as 


mP E J x" fx) dx. (5.1.14) 
—=00 


The expected value of Y = g (X) is 


[20] 


E(g(X)) =f g(x) fx (x) dx. (5.1.15) 


For discrete random variables, these equations become 


E(X) =) xiP(X =xi) (5.1.16 


and 
E(g(X)) = Do g(x) P(X = x). (5.1.17, 
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In the special case where g(X) = (X — E(X )),? E(g(X)) is called the variance of 
X, which is a measure of the spread of the density function of X. If the variance is small, 
it indicates that the random variable is very concentrated around its mean, and in a sense is 
“less random.” However, if the variance is large, then the random variable is highly spread; 
hence, it is less predictable. The variance is denoted by og and its square root, ox, is called 
the standard deviation. The relation for the variance can also be written as 


o? = E(X?) — (E(X))’. 
For any constant c, the following relations hold: 
1. E(cX) =cE(X). 


2. E(c) =c. 
3. E(X +c) = E(X) +c. 


It is also easy to verify that the variance has the following properties: 


1. VAR(cX) = c2VAR(X). 
2. VAR(c) = 0. 
3. VAR(X +c) = VAR(X). 


For the important random variables we introduced earlier, we have the following relations 
for the mean and the variance: 


Bernoulli random variable. E(X) =p VAR(X) = p(1 — p). 


Binomial random variable. E(X)=np VAR(X) = np(1 — p). 
a+b (b —a)* 
2 








VAR(X) = B` 





Uniform random variable. E(X) = 





Gaussian random variable. E(X) =m VAR(X) =o”. 
5.1.5 Multiple Random Variables 


Let X and Y represent two random variables defined on the same sample space (2. For 
these two random variables, we can define the joint CDF as 


Fx,y (x, y) = P{w E Q: X (w) < x, Y (w) < y}, 


or simply as 
Fy y(x, y) = P(X <x,Y < y). 
The joint PDF denoted as fx,y(x, y) is defined by 


a2 
, Y) = ——F , y). 5.1.18 
fxy&œ, y) EF xy(x, y) ( ) 
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The basic properties of the joint and marginal CDFs and PDFs can be summarized with 
the following relations: 


. Fy(x) = Fx,y(x, &). 

. Fy(y) = Fx,y (©, y). > 
» fx) = [R fxx Œ, y)dy. 

~ fr”) = fe, fxr Œ, y) dx. 

H JS SS fx y(x, y)dxdy = 1. 

- P(X, Y) € A) = JJ a yea fur, y)dx dy. 

© Fy yŒ, y) = JŽ o J?» fx x(u, v) du dv. 


DANN A UN m 


The conditional probability density function of the random variable Y, given that the 
value of the random variable X is equal to x, is denoted by fyx(y|x) and defined as 


Ixy (x, y) 
hokt Rar AOFI (5.1.19) 


0, otherwise 


If the density function after the knowledge of X is the same as the density function before 
the knowledge of X, then the random variables are said to be sidtisiicaky independent. For 
statistically independent random variables, 


fxy@, y) = fx) fro). (5.1.20) 
Example 5.1.9 
Let X be N(3, 4) and Y be N(-2, 6). Assuming X and Y are independent, determine 
Fx, (%, y). 


Solution We have 
f(x, y) = fx(x) fe) 
£ 1 Pee g 1 e- oe 














V8n y 127 
ME nye i 
4nV6 


The expected value of g(X, Y), where g(X, Y) is a function of X and Y, is obtained from 


E(g(X, Y)) = J J A E (5.1.21) 











E(XY) is called the correlation of X and Y. Note that if X and Y are independent, ther 
E(XY) = E(X)E(Y). The covariance of X and Y is defined as 


COV(X, Y) = E(XY) — E(X)E(Y). 
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If COV(X, Y) = 0, i.e., if E(XY) = E(X)E(Y), then X and Y are called uncorrelated 
random variables. The normalized version of the covariance, called the correlation coeffi- 
cient, is denoted by Px and is defined as 


_ COV(X, Y) 
ae Oy Oy f 


Using the Cauchy—Schwartz inequality (see Problem 2.37), we can see that |o, „| < 1 and 
` p = +1 indicates a first-order relationship between X and Y, i.e., a relation of the form 
Y = aX + b. The case p = 1 corresponds to a positive a, and ọ = —1 corresponds to a 
negative a. 

Itis obvious that if X and Y are independent, then COV(X, Y) = 0 and X and Y 
will be uncorrelated. In other words, independence implies lack of correlation. It should be 
noted that lack of correlation does not generally imply independence. That is, the covari- 
ance (or P) might be zero, but the random variables may still be statistically dependent. 

Some properties of the expected value and variance applied to multiple random vari- 
ables are as follows: 


1. E(0; c:Xi) = YO; cG E(X)). 
2. VAR; ciXi) = D; 7 VAR(X)) + D; Dj zi cic; COV(Xi, Xj). 
3. VAR; c:Xi) = 0; c7 VAR(X;), if X; and Xj are uncorrelated fori # j. 


In these relations, the c;’s are constants. 


Example 5.1.10 
Assuming that X is N(3, 4), Y is N(—1, 2), and X and Y are independent, determine the 
covariance of the two random variables Z = X — Y and W = 2X + 3Y. 


Solution We have 
E(Z) = E(X) — E(¥) =3+1=4, 
E(W) = 2E(X) + 3E(Y) =6—3=3, 
E(X?) = VAR(X) + (E(X)? =449 = 13, 
E(Y’) = VAR(Y) + (E(¥Y))? =2 +1 = 3, 
and 
E(XY) = E(X)E(Y) = -3. 
Therefore, 
COV(W, Z) = E(W Z) — E(W)E(Z) 
= E(2X? — 3Y? + XY) — E(Z)E(W) 
=2x13-3x3-3-4x3 


=2 E 
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Multiple Functions of Multiple Random Variables. If we define two functions 
of the random variables X and Y by 


Z = g(X,Y) 
W =h(X, Y)’ 


then the joint CDF and PDF of Z and W can be obtained directly by applying the definition 
of the CDF. However, if it happens that for all z and w, the set of equations 


| g(x, y) =z 
h(x, y) = w 


has a countable number of solutions {(x;, y;)}, and if at these points the determinant of the 
Jacobian matrix 


az az 
Ox dy 
J(x, y) = 
3) ðw ow 
ox dy 
is nonzero, then we have 

Ff (xi, Yi) ‘ 
A = c, 5.1.22 
Faw (es) = > T Fae 


i 


where det J denotes the determinant of the matrix J. 


Example 5.1.11 
The two random variables X and Y are independent and identically distributed, each with i 
Gaussian density function with the mean equal to zero and the variance equal to o°. If thesi 
two random variables denote the coordinates of a point in the plane, find the probability densit: 
function of the magnitude and the phase of that point in polar coordinates. 


Solution First, we have to find the joint probability density function of X and Y. Since 3 
and Y are independent, their joint PDF is the product of their marginal PDFs; i.e., 


fxx Œ, y) = fx(x)fr (y) 


1 lay? 
=e H. (5.1.23 





The magnitude of the point with coordinates (X, Y) in the polar plane is given by V = 
/X? + Y2, and its phase is given by © = arctan z, We begin by deriving the joint probabilit 
density function of V and ©. In this case, g(X, Y) = ~X? + Y? and h(X,Y) = arctan + 
The Jacobian matrix is given by 


x y 
2: 2: 2 2 
I(x, y) =| ¥* = an (5.1.2 


Oy x24 y2 
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The determinant of the Jacobian matrix can be easily determined to be equal to 
1 
/ ‘x2 + y2 


1 
== (5.1.25) 
v 


|det J (x, y)| = 


The set of equations 


[x2 + y? =v 
5.1.26 
he 2=0 ( ) 
has only one solution, which is given by 
x = vcosé 
F 2 usna: (5.1.27) 


Substituting these results into Equation (5.1.22), we obtain the joint probability density func- 
tion of the magnitude and the phase as 


fv,o (v, 0) = vfx y (v cos9, vsin0) 





= e w, (5.1.28) 


To derive the marginal probability density functions for the magnitude and the phase, we have 
to integrate the joint probability density function. To obtain the probability density function of 
the phase, we have 


foto) = | fv,o(v, 9) dv 


1 [oe] v2 
= =Í eer or dv 
2n Jo o? 
1 2 7° 
= 5-|-¢ a | 
27 0 
1 
hes (5.1.29) 


Hence, thephaseis uniformly distributed on [0, 21]. To obtain the marginal probability density 
function for the magnitude, we have 


2m 


fy(v) = fv.o(¥, 0) d0 
0 
v _ 22 
= se 22, (5.1.30) 


This relation holds only for positive v. For negative v, fy (v) = 0. Therefore, 
y2 
eat, v>0 


fv) = fo . (5.1.31) 
0, v <0 
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This probability density function is known as the Rayleigh probability density function, and 
it has widespread applications in the study of the fading communication channels. It is also 
interesting to note that in the preceding example 


fv.o(v, 0) = fv (v) fo(0); (5.1.32) 


therefore, the magnitude and the phase are independent random variables. E 


Jointly Gaussian Random Variables. Jointly Gaussian or binormal random 
variables X and Y are distributed according to a joint PDF of the form 


1 
æ, y) = 
Pay 210102,/1 — p2 


1 E zm}? | =m)? _ 26 — my - ma 
20 — p?) 2 2 , 


oO 1 O. 2 0102 
where 7m, ma, or, and ay are the mean and variances of X and Y, respectively, and p is 
their correlation coefficient. When two random variables X and Y are distributed according 
to a binormal distribution, it can be shown that X and Y are normal random variables and 
the conditional densities f(x|y) and f(y|x) are also Gaussian. This property shows the 
main difference between jointly Gaussian random variables and two random variables that 
each have a Gaussian distribution. 

The definition of two jointly Gaussian random variables can be extended to more 
random variables. For instance, X1, X2, and X3 are jointly Gaussian if any pair of them 
are jointly Gaussian, and the conditional density function of any pair given the third one is 
also jointly Gaussian. 

Here are the main properties of jointly Gaussian random variables: 


x exp| - 


1. If n random variables are jointly Gaussian, any subset of them is also distributed 
according to a jointly Gaussian distribution of the appropriate size. In particular, all 
individual random variables are Gaussian. 

2. Jointly Gaussian random variables are completely characterized by the means of all 
random variables mı, m2, ..., M, and the set of all covariance COV(X;, X ;) for all 
1 <i <nand1 < j < n. These so-called second-order properties completely 
describe the random variables. 

3. Any set of linear combinations of (X1, X2,..., Xn) are themselves jointly Gaussian 
In particular, any linear combination of X;’s is a Gaussian random variable. 

4. Two uncorrelated jointly Gaussian random variables are independent. Therefore, foi 
jointly Gaussian random variables, independence and uncorrelatedness are equiva: 
lent. As previously stated, this is not true for general random variables. 


5.1.6 Sums of Random Variables 


If we have a sequence of random variables (X1, X2, ..., Xn) with basically the same prop: 
erties, then the behavior of their average Y = 1 X; Xi is expected to be “less random’ 
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than each X;. The law of large numbers and the central limit theorem are statements of this 
intuitive fact. 

The law of large numbers (LLN) states that if the sequence of random variables 
X 1, X2,..., Xn are uncorrelated with the same mean mx and variance a2 < œ, then for 
any € > 0, lim,... P(|Y —mx| > €) = 0, where Y = ay X;. This means that the 
average converges (in probability) to the expected value. 

The central limit theorem not only states the convergence of the average to the mean, 
but it also gives some insight into the distribution of the average. This theorem states that if 


X;’s are i.i.d. (independent and identically distributed) random variables which each have 
a mean m and variance o”, then Y = 1 $; X; converges to a M (m, Z . We can gen- 
erally say that the central limit theorem states that the sum of many i.i.d. random variables 
converges to a Gaussian random variable. As we will see later, this theorem explains why 
thermal noise follows a Gaussian distribution. 

This concludes our brief review of the basics of the probability theory. References at 
the end of this chapter provide sources for further study. 





: 5,2. RANDOM PROCESSES: BASIC CONCEPTS 


A random process is the natural extension of random variables when dealing with signals. 
In analyzing communication systems, we basically deal with time-varying signals. In our 
development so far, we have assumed that all the signals are deterministic. In many situa- 
tions, the deterministic assumption on time-varying signals is not a valid assumption, and 
it is more appropriate to model signals as random rather than deterministic functions. One 
such example is the case of thermal noise in electronic circuits. This type of noise is due to 
the random movement of electrons as a result of thermal agitation, therefore, the resulting 
current and voltage can only be described statistically. Another example-is the reflection of 
radio waves from different layers of the ionosphere; they make long-range broadcasting of 
short-wave radio possible. Due to the randomness of these reflections, the received signal 
can again be modeled as a random signal. These two examples show that random signals 
can describe certain phenomena in signal transmission. 

Another situation where modeling by random processes proves useful is in the char- 
acterization of information sources. An information source, such as a speech source, gener- 
ates time-varying signals whose contents are not known in advance. Otherwise there would 
be no need to transmit them. Therefore, random processes also provide a natural way to 
model information sources. 

A random process, or a random signal, can be viewed as a set of possible realizations 
of signal waveforms. The realization of one from the set of possible signals is governed 
by some probabilistic law. This is similar to the definition of random variables where one 
from a set of possible values is realized according to some probabilistic law. The difference 
is that in random processes, we have signals (functions) instead of values (numbers). 


Example 5.2.1 


Assume that we have a signal generator that can generate one of the six possible sinusoids. 
The amplitude of all sinusoids is one, and the phase for all of them is zero, but the frequencies 
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can be 100, 200, ..., 600 Hz. We throw a die, and depending on its outcome, which we denote 
by F, we generate a sinusoid whose frequency is 100 times what the die shows (100F). This 
means that each of the six possible signals will be realized with equal probability. This is 
an example of a random process. This random process can be defined as X (t) = cos(2m x 
100F?). a 


Example 5.2.2 


Assume that we uniformly choose a phase © between 0 and 2x and generate a sinusoid with 
a fixed amplitude and frequency but with a random phase ©. In this case, the random process 
is X(t) = Acos(2n fot + ©), where A and fo denote the fixed amplitude and frequency and 
© denotes the random phase. Some sample functions for this random process are shown in 
Figure 5.11. E 


Example 5.2.3 


The process X (t) is defined by X (t) = X, where X is a random variable uniformly distributed 
on [—1, 1]. In this case, an analytic description of the random process is given. For this ran- 
dom process, each sample is a constant signal. Sample functions of this process are shown ir 
Figure 5.12. . I 


From the preceding examples, we see that corresponding to each outcome w; in a sample 
space Q, there exists a signal x(t; wi). This description is similar to the description of ran- 
dom variables in which a real number is assigned to each outcome w;. Figure 5.13 depict: 


0 = —n/4 


ťa O N w * 


Figure 5.11 Sample functions of the random process given in Example 5.2.2. 











Section 5.2 Random Processes: Basic Concepts i 211 





Figure 5.12 Sample functions of the random 


t process given in Example 5.2.3. 


this general view of random processes. Thus, for each w;, there exists a deterministic time 
function x(t; @;), which is called a sample function or a realization of the random process. 
At each time instant tọ and for each w; € Q, we have the number x (to; wi). For the differ- 
ent outcomes (w;’s) at a fixed time fo, the numbers x (to; w;) constitute a random variable 
denoted by X (to). After all, a random variable is nothing but an assignment of real numbers 
to the outcomes of a random experiment. This is a very important observation and a bridge 
that connects the concept of a random process to the more familiar concept of a random 
variable. In other words, at any time instant, the value of a random process is a random 
variable. 


Example 5.2.4 
In Example 5.2.1, determine the values of the random variable X (0.001). ; 
Solution The possible values are cos(0.27), cos(0.41), ... , cos(1.27) and each has a prob- 
ability 2. a 
Example 5.2.5 


Let Q denote the sample space corresponding to the random experiment of throwing a die. 
Obviously, in this case Q = {1, 2, 3, 4, 5, 6}. For all w;, let x(t; w;) = w;e™u_ (t) denote a 
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x(t; w) 


~ 


x (t; @) 


~ 


x(t, w3) į 


™ 


Figure 5.13 Sample functions of 
a random process. 


random process. Then X(1) is a random variable taking values e~!, 2e~!, ..., 6e~! and each 


has probability t. Sample functions of this random process are shown in Figure 5.14. a 


Example 5.2.6 


We can have discrete-time random processes, which are similar to continuous-time random 
processes. For instance, let w; denote the outcome of a random experiment consisting of inde- 
pendent drawings from a Gaussian random variable distributed according to V(0, 1). Let the 
discrete-time random process {Xp}°29 be defined by Xo = 0 and Xn = Xn-1 + æn for all 
n > 1. This is an example of a discrete-time random process, which is nothing but a sequence 
of random variables. I 


5.2.1 Statistical Averages 


At any given time, the random process defines a random variable; at any given set of times 
it defines a random vector. This fact enables us to define various statistical averages for the 
process via statistical averages of the corresponding random variables. For instance, we 
know that, at any time instance fp, the random process at that time, i.e., X (tọ), is an ordinar) 
random variable; it has a density function and we can find its mean and its variance at tha 
point. Obviously, both mean and variance are ordinary deterministic numbers, but the: 
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x(t; w) į 


—4 =3 —2 -1 0 1 2 3 4 ¢ 
x(t; %2) 
2 
| | | | l 
—4 -3 -2 -1 0 1 2 3 4 t 
x(t; %3) 
3 
Figure 5.14 Sample functions 
-4 -3 -2 -i 0 i 2 3 4 t — of Example 5.2.5. 


depend on the time to. That is, at time tı, the density function, and thus the mean and the 
variance, of X (t1) will generally be different from those of X (to). 


Definition 5.2.1. The mean, or expectation, of the random process X (t) is a deter- 
ministic function of time denoted by mx (t) that at each time instant tọ equals the mean of 
the random variable X (fo). That is, my(t) = E[X (t)] for all t. | 


Since at any to the random variable X (tọ) is well defined with a probability density function 


fx (1) (x), we have 
foe) 


XG) =x) = / Jaak (5.2.1) 


Figure 5.15 gives a pictorial description of this definition. 


Example 5.2.7 
The mean of the random process in Example 5.2.2 is obtained by noting that 
+ 0<0<2n 
= 2n = 
fo®) | 0 otherwise 


Hence, 


2T 
E[X(t)] = Í Acos(2n fot + 6) a0 =0. 
0 27 


Weobserve that, in this case, my (t) is independent of t. ĀE 
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Figure 5.15 The mean of a random process. 


Another statistical average that plays a very important role in our study of random pro- 
cesses is the autocorrelation function. The autocorrelation function is especially important 
because it completely describes the power spectral density and the power content of a large 
class of random processes. 


Definition 5.2.2. The autocorrelation function of the random process X (t), denoted 
by Rx(h, t2), is defined by Ry (f1, t2) = E[X(t1) X ()]. a 


From this definition, it is clear that Ry (tı, t2) is a deterministic function of two variables t, 
and f given by 


ee 
a“ 


œo poo rs i 
Ry (t, t2) = f f X1X2 fX), X) (x1, x2) dx, dx. (5.2.2) 
—oo J—00 


Example 5.2.8 
The autocorrelation function of the random process in Example 5.2.2 is 


Rx(h, t2) = E[Acos(2n fot; + ©)A cos(2z fota + @)] 


1 1 
= AE E cos2n folti — t2) + 5 cos(2mfo(t) + t2) + 20)| 


2 
= cos 2n folti — t2), 


where we have used 


2n 
E[cos(2n fo(ti + t2) +20)] = Í cos[21 folti + t2) + 201=—d9 =0. a 
0 
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Example 5.2.9 
For the random process given in Example 5.2.3, we have 
+1 52 1 
Rx (t), b) = E(X?) zd —dx = =. a 
ap 2 3 


5.2.2 Wide-Sense Stationary Processes 


As we have seen, a random process observed at any given time is just a random variable, 
and the properties of this random variable depend on the time at which the random process 
is observed. It can happen that some of the properties of this random variable are indepen- 
dent of time. Depending on what properties are independent of time, different notions of 
stationarity can be defined. One of the most useful notions of stationarity is the notion of 
wide-sense stationary (WSS) random processes. A process is WSS if its mean and auto- 
correlation do not depend on the choice of the time origin. A formal definition is given in 
Definition 5.2.3. 


Definition 5.2.3. A process X (t) is WSS if the following conditions are satisfied: 
1. mx (t) = E[X (t)] is independent of t. 


2. Rx(ti, t2) depends only on the time difference t = tı — t2 and not on ¢; and h 
individually. a 


Hereafter, we will use the term stationary as a shorthand for WSS processes, and their 
mean and autocorrelation will be denoted by mx and Rx(t). 


Example 5.2.10 
For the random process in Example 5.2.2, we have already seen that mx=0 and 
Rx (ti, t2) = pa cos 27 fo(tı — t2). Therefore, the process is WSS. i] 
Example 5.2.11 


Let the random process Y (t) be similar to the random process X (t) defined in Example 5.2.2, 
but assume that © is uniformly distributed between 0 and v. In this case, 


my(t) = E[Acos(2nfot + ©)] 


=A f 2 cos(21 fot + 0) dO 
o = 
= å [sin(27 fot + 4)]5 
= A (—2 sin(2x fot)) 
x 


2A . 
= —— sin(2 fot). 
T 


Since my (t) is not independent of t, the process Y(t) is not stationary. E 


From the definition of the autocorrelation function, it follows that Ry (ti, t2) = 
Rx(t2, tı). This means that if the process is stationary, we have Rx(t) = Rx(—T), i.e., 
the autocorrelation function is an even function in stationary processes. 
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A class of processes closely related to stationary processes is the class of cyclosta- 
tionary processes. In cyclostationary processes, the statistical properties are periodic with 
time. We will give the formal definition of the cyclostationary processes here but their 
importance in communication system analysis will become more apparent in Chapter 10. 


Definition 5.2.4. A random process X (t) with mean mx(t) and autocorrelation 
function Rx(t + T, t) is called cyclostationary if both the mean and the autocorrelation are 
periodic in ¢ with some period Tọ, i.e., if 


mx(t + To) = mx(t) (5.2.3) 
and 
Ry(@t +t +70,t +7) = Rx(t +7, t) (5.2.4) 
for all ¢, and T. a 
Example 5.2.12 
Let X(t) = Acos(2n fot) where A is a random variable uniformly distributed on [0, 1]. Then 
mx(t) = 0.5 cos(21 fot) and 
1 1 
Rx(t +7,t)= z cos(2n fot) + z cos(2n fo(2t + T)) 
It is seen that both mx (t) and Rx(t + T, t) are periodic with period Tọ = mi Therefore the 
process is cyclostationary. - E 
Example 5.2.13 


Let Y(t) = X (t) cos(2x fot) where X(t) is a stationary random process with mean m anc 
autocorrelation Rx (t). Then 


? 
my (t) = E[X (t) cos(2xfot)] = mx cos(2x fot) (5.2.5 
and 


Ry(t +1,t) = E(X(t + 7) cos(2nfo(t + t)) X(t) cos(2x fot)) 


= Rx(t) É cos(2n fot) + ; cos(4n fot + 2n fo)] (5.2.6. 


It is seen that both my(t) and Ry(t + Tt, t) are periodic with period To = 5 Therefore, the 
process is cyclostationary. I 


For a cyclostationary process, X (t), we define the average autocorrelation function 
R x(t), as the average of Rx (t + T, t) over one period, i.e., 


S 1 f% 
Rx(t) = zS Rx(t + t, t)dt (5.2.7 
0 
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Example 5.2.14 
Determine the average autocorrelation function of the random process in Example 5.2.13 


Solution From Equation (5.2.7), we have 
S 1 f[™ 1 1 
Rxy(t) = T f Ry(t) 5 cos(27 fot) + 3 cos(4n fot + 27 fot) | dt 
0 Jo 


2 


= a”) cos(27 fot) (5.2.8) 


To 
= 1kg) cos(27 fot) + aR o f cos(4x fot + 27 fot) dt 
0 0 


ane r 


5.2.3 Multiple Random Processes 


Multiple random processes arise naturally when dealing with two or more random pro- 


cesses. For example, take the case where we are dealing with a random process X (t) and 
we pass it through a linear time-invariant (LTI) system. For each sample function input 
x(t; wi), we have a sample function output y(t; wi) = x(t; wi) x» h(t), where h(t) denotes 
the impulse response of the system. We can see that for each œw; € Q, we have the two 
signals x(t; @;) and y(t; w;). Therefore, we are dealing with two random processes, X (t) 
and Y (t). When dealing with two random processes, we naturally question the dependence 
between the random processes under consideration. To this end, we define the indepen- 
dence of two random processes. 


Definition 5.2.5. Two random processes X (t) and Y(t) are independent if for all 


positive integers m,n, and for all tı, t2,..., fn and T1, T2,..., Tm, the random vectors 
(X(t), X (t2), ..., X (ta)) and (Y (t1), Y (T2), ..., Y (Tm)) are independent. Similarly, X (t) 
and Y (t) are uncorrelated if the two random vectors are uncorrelated. E 


From the properties of random variables, we know that the independence of random pro- 
cesses implies that they are uncorrelated, whereas uncorrelatedness generally does not 
imply independence, except for the important class of Gaussian processes (to be defined 


in Section 5.3) for which the two properties are equivalent. Next, we define the correlation 


function between two random processes. 


Definition 5.2.6. The cross correlation between two random processes X (t) and 
Y(t) is defined as 
Ryy (ti, 2) = E[X()Y()]. (5.2.9) 


a 
From the preceding definition, in general, 
Rxy (i, t2) = Ryx(h, tı). (5.2.10) 


The concept of stationarity can also be generalized to joint stationarity for the case 
of two random processes. 
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Definition 5.2.7. Two random processes X (t) and Y (t) are jointly wide-sense sta- 
tionary, or simply jointly stationary, if both X (t) and Y (t) are individually stationary and 
the cross-correlation Ryy (t1, t2) depends only on T = t; — ho. E 


Note that for jointly stationary random processes, from the definition and Equation (5.2.10), 
it follows that 


Rxy(t) = Ryx(—T). (5.2.11) 


Example 5.2.15 
Assuming that the two random processes X (t) and Y(t) are jointly stationary, determine the 
autocorrelation of the process Z(t) = X(t) + Y(t). 


Solution By definition, 


Rz(t +1, t) = E[Z¢@+7)Z@]) 
=E(X(t+1)+¥C4+7))(XH+YO)) 
= Rx(t) + Ry(t) + Rxy(t) + Rxy(—T). a 


5.2.4 Random Processes and Linear Systems 


In the section on multiple random processes, we saw that when a random process passes 
through an LTI system, the output is also a random process defined on the original prob- 
ability space. In this section, we will study the properties of the output process based on 
the knowledge of the input process. We are assuming that a stationary process X (t) is the 
input to an LTI system with the impulse response h(t) and the output process is denoted 
by Y(t), as shown in Figure 5.16. 

Weare interested in the following questions: Under what conditions will the output 
process be stationary? Under what conditions will the input and output processes be jointly 
stationary? How can we obtain the mean and the autocorrelation of the output process, as 
well as the cross correlation between the input and output processes? 

We next demonstrate that if a stationary process X (t) with mean my and autocor- 
relation function Rx(t) is passed through an LTI system with impulse response h(t), the 
input and output processes X (t) and Y (t) will be jointly stationary with 


my = myx i h(t) dt; (5.2.12) 
Rxy(t) = Rx(t) * h(—-7T); (5.2.13) 
Ry(t) = Rx(t) x h(t) x h(—T). (5.2.14) 


Figure 5.16 A random process passing through an LTI system. 
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By using the convolution integral to relate the output Y (t) to the input X (t), i.e., Y(t) = 
foe, X(t) h(t — t) dt, we have 


E{(Y@)] = | | xoa- nar] 


is E[X (t)]h(t — t) dt 


[mene —T)dt 


o0 


u=t—T 9 
= mx f h(u)du = my. 
oo 


. This proves that my is independent of t. 


The cross-correlation function between the output and the input is 
E[X (4) ¥(o)] E[X(H) ¥ (m)] 


E [xa iM X (s)h(ta — s) as| 


a E[X (t1)X (s)]h(t2 — s) ds 


m Rx(ti — s)h(t — s) ds 


u=s—f; sa 
= f Ryti — b — uh(—u)du 


—~00 


/ Rx(t —u)h(—u) du 


Rx(t) *h(—t) = Rxy(t). 





This shows that Ryy(t), t2) depends only on T = t, — fp. 
The autocorrelation function of the output is 


E(Y(t)¥()] ELY Y (2)] 


E (a X(s)h(t; — s) as) re] 


= | Rxy(s — b)h(ti — s) ds 


vase / Ryy(wh(ty — h — u) du 
— 0 


= Ryy(t)»*xh(t) 
Rx(t) * h(—T) x h(t) = Ry(t). 
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In the last step, we have used the result of the preceding step. This shows that Ry and Rxy 
depend only on t = tı — fz and, hence, the output process is stationary. Therefore, the input 
and output processes are jointly stationary. 


Example 5.2.16 f 
Assume a stationary process passes through a differentiator. What are the mean and autocor- 
relation functions of the output? What is the cross correlation between the input and output? 


Solution In a differentiator, h(t) = 5’(t). Since 5’(t) is odd, it follows that 


co 
my = my f 5'(t) dt 


00 


=0. 


The cross-correlation function between output and input is 
id t d 
Rxy(t) = Rx (1) *8'(—t) = —Rx(t) * ô' (1) = -g RO 


and the autocorrelation function of the output is 


d a 
Ry(t) = a, Rx®) x(t) = -ga xO 


where we have used Equation (2.1.34). E 


Example 5.2.17 
Repeat the previous example for the case where the LTI system is a quadrature filter defined 
by A(t) = +; therefore, H (f) = — j sgn( f ). In this case, the output of the filter is the Hilbert 


net 
transform of the input (see Section 2.6). 


Solution We have 


o] 
my = mx f —dt =0 
oo Tit 


1 


because -; is an odd function. The cross correlation function is 





1 å 
Rxy(T) = Rx(T) * = —Rx(T), 
TNT 
and the autocorrelation function of the output is 
1 a 
Ry (t) = Rxy(t) * = —Rx(t) = Rx(*), 


where we have used the fact that R(t) = —x(t) and assumed that Rx(t) has no DC 
component. r 


5.2.5 Power Spectral Density of Stationary Processes 
A random process is a collection of signals, and the spectral characteristics of these signals 


determine the spectral characteristics of the random process. If the signals of the random 
process are slowly varying, then the random process will mainly contain low frequencies 
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and its power will be mostly concentrated at low frequencies. On the other hand, if the 
signals change very fast, then most of the power in the random process will be at the 
high-frequency components. 

A useful function that determines the distribution of the power of the random process 
at different frequencies is the power spectral density or power spectrum of the random 
process. The power spectral density of a random process X (t) is denoted by Sy(f), and 
denotes the strength of the power in the random process as a function of frequency. The 
unit for power spectral density is W/Hz. 

For stationary random processes, we use a very useful theorem that relates the power 
spectrum of the random process to its autocorrelation function. This theorem is known as 
the Wiener—Khinchin theorem. 


Theorem [Wiener—-Khinchin] For a stationary random process X (t), the power spectral 
density is the Fourier transform of the autocorrelation function, i.e., , 


For a cyclostationary process, the power spectral density is the Fourier transform of the 
average autocorrelation function, i.e., 


Sx(f) = F[Rx(t)] (5.2.16) 


Example 5.2.18 
For the stationary random process in Example 5.2.2, we had 


A? 
Rx(t) = SH cos(2n fot). 


Hence, 
A? 
Sx(f) = zeU — fo) + 6(f + fod]. 


The power spectral density is shown in Figure 5.17. All the power content of the process 
is located at fọ and — fo. This is expected because the sample functions of this process are 


sinusoidals with their power at those frequencies. a 
Sx(f) 
A? A? 
4 4 
Figure 5.17 Power spectral 





> density of the random process of 
fo fo f Example 5.2.18. 
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Example 5.2.19 
The process of Example 5.2.3 is stationary. In this case, 


Rx(t) = E[X(t + t)X(t)] = E[X?] = 


wl 


Hence, 


] 1 
Sx(f) = a5] = zo) 


Obviously, in this case, for each realization of the process, we have a different power 
spectrum. C 


Example 5.2.20 
Determine the power spectrum of the random process given in Example 5.2.13 


Solution From Example 5.2.14, we know that the process is cyclostationary with average 
autocorrelation function ‘ 


Rx(t) = aan cos(2n fot). 





Taking the Fourier transform, we obtain 


Rx(t) 





1 1 
Sx(f) = a| cos(2n far) | = goxf — fo) + goxF + fo). (5.2.17) 


This shows that the power spectrum of a modulated WSS random process is the power spec- 
trum of the original signal shifted by + fo and scaled by 1/4. a 


The power content, or simply the power, of a random process is the sum of the 
powers at all frequencies in that random process. In order to find the total power, we have 
to integrate the power spectral density over all frequencies. This means that the power in 
the random process X (t), denoted by Py, is obtained using the relation 


Py = / Sx(f) af. (5.2.18) 


Since Sx(f) is the Fourier transform of Ry(t), then Rx(t) will be the inverse 
Fourier transform of Sy(f). Therefore, we can write 


Ry(t) = J a Sy(f)ei2™F dt. (5.2.19) 


Substituting t = 0 into this relation yields 


o0 
Rx (0) =i Sx(f)df. (5.2.20) 
— 00 
Comparing this with Equation (5.2.18), we conclude that 


Py = Rx(0). (5.2.21) 
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From Equations (5.2.18) and (5.2.21), we see that the power in a stationary random process 
can be found either by integrating its power spectral density (adding all power components) 
or substituting t = 0 in the autocorrelation function of the process. 


Example 5.2.21 : 
Find the power in the process given in Example 5.2.18 


Solution We can use either the relation 


P; = / Sx(f) df 


fos] 2 
=f EZ — fo) +f + fo af 


= 
A? 

=2 x — 
x% 


A? 


2 


or the relation 


Px = Rx (0) 
2 
= As cos(2x fot) 
2 
t=0 


A? 
= —. a 
2 


Power Spectra in LTI Systems. We have seen that when a stationary random 
process with mean m, and autocorrelation function Rx(t) passes through an LTI system 
with the impulse response h(t), the output process will be also stationary with mean 


o0 
my = mx | h(t) dt 
o0 


and autocorrelation 
Ry(t) = Ry(T)xh(t)x»xh(—T). 


We have also seen that X(t) and Y(t) will be jointly stationary with the cross-correlation 
function 


Rxy(t) = Rx(t) x h(—T). 


Translation of these relations into the frequency domain is straightforward. By noting that 
Fl(h(—t)] = H*(f) and S r h(t) dt = H (0), we can compute the Fourier transform of 
both sides of these relations to obtain 


my =myH(0) (5.2.22) 
Sy(f) =Sx(f)IA(f)P. (5.2.23) 
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The first equation says that since the mean of a random process is basically its DC value, 
i.e., the mean value of the response of the system only depends on the value of H(f) 
at f = 0 (DC response). The second equation says that when dealing with the power 
spectrum, the phase of H(/) is irrelevant; only the magnitude of H (f) affects the output 
power spectrum. This is also intuitive since power depends on the amplitude and not on 
the phase of the signal. For instance, if a random process passes through a differentiator, 
we have H(f) = j2nf; hence, 


my =mxH(0)=0 (5.2.24) 
Sy(f) = 40’ f?Sx(f).. (5.2.25) 


We can also define a frequency-domain relation for the cross-correlation function. 
Let us define the cross-spectral density S xy (f) as 


Sxv(f) = FRx). (5.2.26) 
Then l 
Sxy(f) = Sx (f) H"(f), (5.2.27) 

and since Ryx(t) = Rxy(—t), we have : 
Syx(f) = Sky (f) = Sx(f) H(f). (5.2.28) 


Note that although Sx(f) and Sy(f) are real nonnegative functions, Sxy (f) and Syx(f) 
can generally be complex functions. Figure 5.18 shows how these quantities are related. 


Example 5.2.22 
If the process in Example 5.2.2 passes through a differentiator, we have H(f) = j2xf: 
therefore, 


A? i 
Sy(f) = 4x? f? [ror — fo) +8(f + fa) | = A’n? fC — fo) + 8(F + fo)] 


Sxy(f) ` 


Sx(f) Syx(f) 





\H( pÊ Sy(f) Figure 5.18 Input--output relations for the power 
spectral density and the cross-spectral density. 
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and 


j A? 
SrO = GANSKA) = ETE UE + fo -8 - l. 2 


Example 5.2.23 ` 
Passing the process in Example 5.2.3 through a differentiator results in 


Sy(f) = 41’ f? (o) =0, 


1 
Sxr(f) = (-j2af) (547) =0, 


where we have used the basic properties of the impulse signal in both relations. These results 
are intuitive because the sample functions of this process are constant and differentiating them 
results in zero output. a 


5.2.6 Power Spectral Density of a Sum Process 


In practice, we often encounter the sum of two random processes. For example, in the case 
of communication over a channel with additive noise, the noise process is added to the 
signal process. Next, we determine the power spectral density for the sum of two jointly 
stationary processes. 

Let us assume that Z(t) = X(t) + Y(t), where X (t) and Y (t) are jointly stationary 
random processes. We already know from Example 5.2.15 that Z (t) is a stationary process 
with 

Rz(t) = Rx(t) + Ry(t) + Rxy(t) + Ryx(t). (5.2.29) 


Taking the Fourier transform of both sides of this equation and using the result of Prob- 
lem 5.39, we obtain 


Sz(f) =Sx(f) + Sy(f) + Sxv(f) + Syx(f) 
S$ 

Sty (f) 

= Sy(f) + Sy(f) + 2Re[Syy(f)]. (5.2.30) . 


The preceding relation shows that the power spectral density of the sum process is the sum 
of the power spectra of the individual processes plus a third term, which depends on the 
cross correlation between the two processes. 

If the two processes X (t) and Y (t) are uncorrelated, then 


Rxy(t) = mx my. 
Now if at least one of the processes is zero mean, we will have Rxy(t) = 0 and 


S2(f) = Sx(f) + Sy(f). (5.2.31) 
2 
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Example 5.2.24 

Let X (t) represent the process in Example 5.2.2, and let Z(t) = X(t) + 4x (t). Then 

Sxr(f) = a L (8(F + fo) — ôC — o)l; 
therefore, 
Re[Sxr(f)]=0 
Hence, 
1 : 
S2(f) = Sx(f) + Sy(f) = A’ G + x? ft) [ECF — fo) +6(f + fo)l. a 


5.3 GAUSSIAN AND WHITE PROCESSES 


























Gaussian processes play an important role in communication systems. The fundamental 
reason for their importance is that thermal noise in electronic devices, which is produced 
by the random movement of electrons due to thermal agitation, can be closely modeled by 
a Gaussian process. 

In order to seethe reason why thermal noise is a Gaussian process, consider a resistor. 
The free electrons in a resistor move as a result of thermal agitation. The movement of the 
electrons is quite random and can be in any direction; however, their velocity is a function 
of the ambient temperature. The higher the temperature, the higher the velocity of the 
electrons. The movement of these electrons generates a current with a random value. We 
can consider each electron in motion as a tiny current source, whose current is a random 
variable that can be positive or negative, depending on the direction of the movement of 
the electron. The total current generated by all electrons, which is the generated thermal 
noise, is the sum of the currents of all these current sources. We can assume that at least 
a Majority of these sources behave independently and, therefore, the total current is the 
sum of a large number of independent and identically distributed random variables. Now 
by applying the central limit theorem, we conclude that this total current has a Gaussian 
distribution. This is the reason that thermal noise can be very well modeled by a Gaussian 
random process. $ 

Gaussian processes provide rather good models for some information sources as 
well. Some interesting properties of the Gaussian processes, which will be discussed in 
this section, make these processes mathematically tractable and easy to use. 


vi 


5.3.1 Gaussian Processes 


In a Gaussian random process, we can look at different instances of time and the resulting 
random variables will be jointly Gaussian. We start our discussion with a formal definition 
of Gaussian processes. 


Definition 5.3.1. A random process X(t) is a Gaussian process if for all n anc 
all (t1,f,...,t), the random variables {X(¢;)}?_, have a jointly Gaussian density 
function. i 
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From the preceding definition, we know that at any time instant fo, the random variable 
X (to) is Gaussian; at any two points tı, #2, random variables (X (t1), X (t2)) are distributed 
according to a two-dimensional jointly Gaussian distribution. 


Example 5.3.1 


Let X (t) be a zero-mean stationary Gaussian random process with the power spectral density 
S(t) =5 (do). Determine the probability density function of the random variable X (3). 


Solution Since X(t) is a Gaussian random process, the probability density function of ran- 

` dom variable X (t) at any value of t is Gaussian. Therefore, X (3) ~ N (m, o?). Now we need 
to find m and o?. Since the process is zero mean, at any time instance t, we have E[X(t)] = 0; 
this means m = E[X(3)] = 0. To find the variance, we note that 


o° = VAR[X(3)] 
= E[X*(3)] — (EIX 6)? 
= E[X(3)X(3)] 
= Rx(0), 


where, in the last step, we have used the fact that the process is stationary and hence 
E[X(t) X(®)] = Rx (t, — t). But from Equation (5.2.20), we have 


o? = Rx(0) 


oo 
= f Sx(f) af 
: -00 
500 
= i Saf 
—500 
= 5000; 
therefore, X (3) ~ M (0, 5000), or the density function for X (3) is 2 
° 1 
-/ 10,0007 


Just as we have defined jointly Gaussian random variables, we can also define jointly 
Gaussian random processes. 


fx) = eim, = 


Definition 5.3.2. The random processes X (t) and Y (t) are jointly Gaussian if for 


all n, m and all (ti, t2,...,tn) and (T1, T2,...,Tm), the random vector (X (ti), 
X(t2),..-,X (ta), Y (t1), Y (T2), ..., ¥(tm)) is distributed according to an n + m dimen- 
sional jointly Gaussian distribution. a 


From this‘definition, it is obvious that if X (t) and Y (t) are jointly Gaussian, then each of 
them is individually Gaussian; but the converse is not always true. That is, two individually 
Gaussian random processes are not always jointly Gaussian. 

Gaussian and jointly Gaussian random processes have some important properties that 
are not shared by other families of random processes. Two of the most important properties 
are given here: 


il 


228 Probability and Random Processes Chapter 5 


Property 1. If the Gaussian process X (t) is passed through an LTI system, then 
the output process Y (t) will also be a Gaussian process. Moreover, X (t) and Y ce will be 
‘jointly Gaussian processes. 

: This property follows directly from the fact that linear combinations of jointly Gaus- 
sian random variables are themselves jointly Gaussian. 

| This result is very important; it demonstrates one of the properties that makes Gaus- 
sian processes attractive. For a non-Gaussian procesś, knowledge ofthe statistical prop- 
erties of the input process does not easily lead to the statistical properties of the output 
process. However, for a Gaussian process, we know that the output process of an LTI 
system will also be Gaussian. 


Property 2. For jointly Gaussian processes, uncorrelatedness and independence 
are equivalent. 
This is also a straightforward consequence of the basic properties of Gaussian ran- 
dom variables, as outlined in our discussion of jointly Gaussian random variables. - 





Example 5.3.2 
The random process X(t) in Example 5.3.1 is passed through a differentiator and the out- 
put process is denoted by Y(t), i.e., Y Œ) = ix (t), determine the probability density func- 
tion Y (3). 


Solution Since a differentiator is an LTI system, by Property 1, Y(r) is a Gaussian process. 
This means that Y(3) is a Gaussian random variable with mean my and variance ož. Also 
note that the impulse response of a differentiator is h(t) = 5’(t); hence, its transfer function is 
H(f) = F[h(t)] = j2x f. In order to find my, we use the result of Example 5.2.16, which 
shows my = 0. To find cĉ, we use a method similar to the one employed in Example 5.3.1. 
First we have to find Sy (f), which can be found using Equation (5.2.23) with H (f) = j2mf. 
This is very similar to Examples 5.2.22 and 5.2.23. We have 








ig f SP) df 


500 g 
z 5x 4n’ f? df 





z~ 1.64 x 10!. 


Since my = 0 and o? = 1.64 x 10!°, we have Y (3) ~ N (0, 1.64 x 10!°). I 


5.3.2 White Processes 


The term white process is used to denote processes in which all frequency component 
appear with equal power, i.e., the power spectral density is a constant for all frequencies 
This parallels the notion of “white light,” in which all colors exist. 
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Definition 5.3.3. A process X(t) is called a white process if it has a flat power 
spectral density, i.e., if Sx( f) is a constant for all f. C] 


In practice, the importance of white processes stems from the fact that thermal noise can be 
closely modeled as a white process over a wide range of frequencies. Also, a wide range of 
processes used to describe a variety of information sources can be modeled as the output of 
LTI systems driven by a white process. Figure 5.19 shows the power spectrum of a white 
process. f ; 

If we find the ‘power content of a white process using Sy(f) = C, a constant, we 
will have 








Py = [7 suanar = f" caf =o. 


Obviously, no real physical process can have infinite power; therefore, a white process is 
not a meaningful physical process. However, quantum mechanical analysis of the thermal 
noise shows that it has a power spectral density given by 


A 
ee (5.3.1) 
2(eit - 1) 
in which # denotes Planck’s constant (equal to 6.6 x 10~** J x sec) and k is Boltzmann’s 
constant (equal to 1.38 x 107% J/K). T denotes the temperature in degrees Kelvin. This 
, power spectrum is shown in Figure 5.20. 

The preceding spectrum achieves its maximum at f = 0, and the value of this max- 
imum is a The spectrum goes to zero as f goes to infinity, but the rate of convergence 
to zero is very slow. For instance, at room temperature (T = 300° K) S,(f) drops to 90% 
of its maximum at about f ~% 2 x 10! Hz, which is beyond the frequencies employed in 
conventional communication systems. Thus, we conclude that thermal noise, though not 
precisely white, for all practical purposes can be modeled as a white process with a power 


Sa (f) = 





Sx(f) } 
No 
2 
>  Figure5.19 Power spectrum of a white 
f process. 
Sn) i} 
kT 
2 





= Figure 5.20 Power spectrum of thermal 
f noise. 
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spectrum equal to 2. The value kT is usually denoted mes No; therefore, the power spectral 
density of termala noise is usually given as S,(f) = Mo . It is sometimes referred to.as the 
two-sided power spectral density, emphasizing that this spectrum extends to both positive 
and negative frequencies. We will avoid this terminology throughout this text and simply 
use power spectrum or power spectral density. 

Looking at the autocorrelation function for a white process, we see that E 


R,(t) = F! [>] = Msa). (5.3.2) 
2 2 ‘ 
This shows that for all t # 0, we have Rx(t) = 0. Thus, if we sample a white process 
at two points ¢; and fz (fi Æ t2), the resulting random variables will be uncorrelated. If 
the random process is white and also Gaussian, any pair of random variables X (t1), X (t2), 
where t Æ t2, will also be independent (recall that for jointly Gaussian random variables, 
uncorrelatedness and independence are equivalent.) : 


Properties of the Thermal Noise. The thermal noise that we will use in subse- 
quent chapters is assumed to have the following properties: 


_ 1. Thermal noise is a stationary process. 
2. Thermal noise is a zero-mean process. 
3. Thermal noise is a Gaussian process. 


4. Thermal noise is a white process with a power spectral density S,,(f) = a, i 
Na 
It is clear that the power spectral density of thermal noise increases with increasing the 
ambient temperature; therefore, keeping electric circuits cool makes their noise level low. 


5.3.3 Filtered Noise Processes 


In many cases, the white noise generated in one stage of the system is filtered by the next 
stage; therefore, in the following stage, we encounter filtered noise that is a bandpass pro- 
cess, i.e., its power spectral density is located away from the zero frequency and is mainly 
located around some frequency fs, which is far from zero and larger than the bandwidth of 
the process. i 

In a bandpass process, sample functions are bandpass signals; like the bandpass sig- 
nals discussed in Section 2.7, they can be expressed in terms of the in-phase and quadrature 
components. In this section, we study the main properties of bandpass noise processes and 
particularly study the main properties of the in-phase and quadrature processes. 

Let us assume that the process X (t) is the output of an ideal bandpass filter of band- 
width W which is located at frequencies around f,. For example, one such filter can have 
a transfer function of the form 


H (f) = If- fel SW (5.3.3) 


0 otherwise 
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f.-W fh ; L LW f 


Figure 5.21 Filter transfer functions H, (f) and H2(f). 


Another example of a such filter would be 


1 fe<slIfl< fe+W 


(5.3.4) 
0 otherwise 


Hf) = 
Figure 5.21 shows plots of the transfer functions of these two filters. 
Since thermal noise is white and Gaussian, the filtered thermal noise will be Gaussian 
but not white. The power spectral density of the filtered noise will be 


N N 
Sx(f) = Slr = SA), 
20° 2 
where we have used the fact that for ideal filters |H (f)I? = H(f). For the two filtered 


noise processes corresponding to Hı (f) and H2(f), we have the following power spectral 
densities: 


No 
= fp? lf s fel < W 
Sa) = ( otherwise 39) 
and i 
w fe<\fl<fet+W 
—]?2 = = 
SK) = ; otherwise f O20) 


All bandpass filtered noise signals have an in-phase and quadrature component that are 
lowpass signals. This means that the bandpass random process X (t) can be expressed in 
an equation that is similar to Equation (2.7.8), i.e., 


X(t) = X(t) cos(2n fet) — X s(t) sin(2n fct), (5.3.7) 
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where X(t) and X ,(t)—the in-phase and quadrature components—are lowpass processes. 
Note that, as we did with Equation (2.7.17), we can also represent the filtered noise in terms 
of its envelope and phase as 


X(t) = A(t) cos(2n fet + 0(t)), (5.3.8) 


where A(t) and @(t) are lowpass random processes. 


Properties of the In-Phase and Quadrature Processes. For filtered’ white 
Gaussian noise, the following properties for X,(t) and X,(t) can be proved (for Pros, 
see Proakis and Salehi (2008)): 


1. X(t) and X(t) are zero-mean, lowpass, jointly stationary, and jointly Gaussian rap- 
dom processes. Oo =Total per, ( fo, eromalo* 2N,W sr filter 

2. If the power in process X (t) is Px, then the power in each of the processes X(t) and 
X s(t) is also Px. In other words, 


P= Pr = Px, = f SKAF, (539) 


3. Processes X,(t) and X(t) have a common power spectral density. This power. spec- 
tral density is obtained by shifting the positive frequencies in Sy(f) to the left by 
fc, shifting the negative frequencies of Sx (f) to the right by fz, and adding the two 
shifted spectra. Therefore, if H)(f), given in Equation (5.3.3), is used, then 


N <W 
OERE h oe 4 (53.10) 
otherwise 
and if H2 (f), given in Equation (5.3.4) is used, then . 
No 
ed <W 
sea Er, (5.3.11) 


0 otherwise 


The power spectral densities for these two cases are shown in Figure 5.272. It is obvi- 
ous that in both cases, the power in the in-phase and quadrature components is equal 
to the power in the bandpass noise. When H;(f) is used, the power in the filtered 
noise from Equation (5.3.5) is P) = a x 4W = 2NoW, and when H2(f) is used, 
the power in the filtered noise from Equation (5.3.6) is P) = Mo x 2W = NoW. By 
integrating the power spectral density, we can see that the power in the in-phase and 
quadrature components in these two cases are 2NoW and NoW, respectively, which 
are equal to the corresponding P, and P2. 

4. If +f, and —f, are the axis of symmetry of the positive and negative frequencies 
respectively, in the spectrum of H (f)—as is the case for Hı (f) in Equation (5.3.3) 
and is not the case for Hj(f) in Equation (5.3.4)then X,(t) and X,(t) will be 
independent processes. 
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a Sx Ff ) = Sx (f ) 


No 





a 


-W w f 


j Sx f) = Sx) 





Figure 5.22 Power spectral densities of the in-phase and quadrature component's of the filtered noise passed 
through H (f) and H,(f). 


Example 5.3.3 


For the bandpass white noise at the output of filter Hı P), given in Equation (5.3.3), find the 
power spectral density of the process Z(t) = aX .¿(t) + bX s(t). 


Solution Since f, is the axis of symmetry of the noise power spectral density, the in-phase 
and quadrature components of the noise will be independent; therefore, we are dealing with 
the sum of two independent and zero-mean processes. This means that the conditions leading 
to Equation (5.2.31) are satisfied, and the power spectral density of Z(t)'is the sum of the 
power spectral densities of aX .(t) and bX, (t). But we already know that Sy,(f) = Sx, (f) = 
` Sy(f), therefore S7(f) = a?Sx(f) + BS (f) = (a? + b*)Sx (fY. Note that in the special 
case where a = cos 0 and b = — sin @, we have Sz(f) = Sx(f). E 








Noise Equivalent Bandwidth. When a white Gaussian noise passes through a 
filter, the output process, although still Gaussian, will not be white anymore. The filter 
characteristic shapes the spectral properties of the output process, and we have 


Ni 
Sy(f) =Sx(f)IA(f)P = z EOP. 


Now if we want to find the power content of the output process, we have to integrate Sy (f). 
Thus, 


Co N CO 
Py = | Sy(f)df = =f IH(f)Ê df. 
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Therefore, to determine the output power, we have to evaluate the integral J ier |H(f) 2d f: 
To do this calculation, we define Bneq, the noise equivalent bandwidth of a filter with the 
frequency response H (f), as 


fre IAA)? df 
neg = (5.3.12) 
2 H2 x 
where Hmax denotes the maximum of |H(f)| in the passband of the filter. Figure 5.23 
shows Hmax and Bneg for a typical filter. l 
Using the preceding definition, we have 


N, foe) 
Pe =e [nn ear 


= No Breq Hoax: (5.3.13; 


Therefore, when we have Breq, finding the output noise power becomes a simple task. The 
noise equivalent bandwidth of filters and amplifiers is usually provided by. the 
manufacturer. 


Example 5.3.4 
Find the noise equivalent bandwidth of a lowpass RC filter. 


Solution For this filter, 


1 


HY Tain F RO’ 


and is shown in Figure 5.24. Defining t = RC, we have 


1 


H = —___—_, 
SOS Tae 


Figure 5.23 Noise equivalent 
> Bug <— f bandwidth of a typical filter. 
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1 
HOO ae 
1 





a pa aie ae 





Figure 5.24 Frequency response of a lowpass 
0 f RC filter. 


and therefore H max = 1. We also have 
er 
o 1+ 4n? f2r2 
w=2nft f a | du 


o l+u? X Inr 


f IHAPdf = df 


ye 
2 


Hence, 


5.4 SUMMARY AND FURTHER READING 


In this chapter, we presented a review of the basic definitions and concepts in probabil- 
ity and random processes. Several random variables and their properties were introduced, 
including the Bernouli random variable, the binomial random variable, the uniform random 
variable, and the Gaussian random variable. Functions of random variables were treated, 
including statistical averages such as the mean value and the variance. We also covered 
multiple random variables and joint statistical parameters of multipte random variables, 
such as joint moments, e.g., their covariance. 

In our coverage of random processes, we introduced wide-sense stationary pro- 
cesses and their autocorrelation function, and multiple random processes and their cross- 
correlation functions. We also introduced the power spectral density of stationary processes 
and the Wiener—-Khinchin theorem, which relates the power spectral density to the Fourier 
transform of the autocorrelation function. When a linear time-invariant (LTI) system is 
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excited by a stationary random process, the system output is also stationary. In such a case, 
we related the aurocorrelation and power spectral density of the LTI system output to the 
autocorrelation and power spectral density of the input random process and to time- and 
frequency-response characteristics of the LTI system. 

The final topic of this chapter focused on Gaussian random processes and spectrally 
white processes. In our treatment of white processes, we presented the properties of ther- 
mal noise and characterized thermal noise as a white Gaussian random process. We also 
considered filtering a white Gaussian noise process by a bandpass filter to obtain a band- 
pass Gaussian noise process. Then, we showed that such a bandpass filtered noise pro- 
cess is expressed in terms of an in-phase component and a quadrature component, both of 
which are lowpass random processes. Finally, the spectral properties of the in-phase and 
quadrature random processes were presented. 

The books by Leon Garcia (1994), Helstrom (1991), Davenport and Root (1987), 
Papoulis (1991), Nelson (1995), and Stark and Woods (1994) cover probability and ran- 
dom processes with an emphasis on electrical engineering applications. Gray and Davisson 
(1986) is particularly interesting, since it covers random processes for electrical engineers 
without compromising mathematical rigor. 


PROBLEMS 


5.1 A random experiment consists of drawing a ball from an um that contains four red 
balls (numbered 1, 2, 3, 4) and three black balls numbered (1, 2, 3). Precisely state 
what outcomes are contained in the following events: 


. E, = The number on the ball is even. 

E = The color of the ball is red and its number is greater than 1. 
. E3 = The number on the ballis less than 3. 

. E4 = E1 U E3. 

. Es = E; U (E2 N E3). 


vn Pp wn = 


5.2 If all balls in the preceding problem are equally likely to be drawn, find the proba- 
bilities of E;, 1 <i <5. 


5.3 Ina certain city, three car brands A, B, and C have 20%, 30%, and 50% of the market 
share, respectively. The probabilities that a car will need major repair during its first 
year of purchase are 5%, 10%, and 15%, for these brands, respectively. 


1. What is the probability that a car in this city will need major repair during its 
first year of purchase? 


2. If a car in this city needs major repair during its first year of purchase, what is 
the probability that it is made by manufacturer A? 


5.4 Under what conditions can two disjoint events A and B be independent? 
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5.5 An information source produces 0 and 1 with probabilities 0.3 and 0.7, respectively. 
The output of the source is transmitted via a channel that has a probability of error 
(turning a 1 into a0 or a 0 into a 1) equal to 0.2. 


1. What is the probability that a 1 will be observed at the output? 


2. What is the probability that a 1 was the ee of the source if a 1 is observed at 
the output of the channel? 


5.6 A coin is flipped three anes, and the random variable X denotes the total number of 
heads that show up. The probability of landing on a head in one flip of this coin is 
denoted by p. 


1. What values can the random variable X take? 

2. What is the probability mass function of the random variable X? 
3. Derive and plot the CDF of X. 

4. What is the probability that X exceeds 1? 


5.7 For coin A, the probability of landing on a head is equal to 7 and the probability of 
landing on a tail is equal to 3; coin B is a fair coin. Each coin is flipped four times. 
Let the random variable X denote the number of heads resulting from coin A, and Y 
denote the resulting number of heads from coin B. 


1. What is the probability that X = Y = 2? 
2. What is the probability that X = Y? 
3. What is the probability that X > Y? 
4. What is the probability that X + Y < 5? 


ee 


5.8 A random variable X is defined by the CDF 
0, x <0 
ROS ge, Q<x<1. 
K, x>1- 


. Find the value of K. 

. Is thisrandom variable discrete, continuous, or mixed? 
. What is the probability that 5 < X < 1? 

. What is the probability that 5 < X < 1? 

. What is the probability that X exceeds 2? 


anak WN = 


5.9 Random variable X is distributed according to fy(x) = A(x). 


1. Determine P(X > 5). 
2. Determine P(X > O|X < 5). 
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3. Whatis fx(x|X > 4)? 


4. 


What is E(X|X > 4)? 


5.10 X is a Gaussian random variable with a mean 4 and a variance 9, i.e., N(4, 9). 
Determine the following probabilities: 


1. 
2. 


P(X > 7). 
P(0 < X <9). 


5.11 The noise voltage in an electric circuit can be modeled as a Gaussian random variable 
with a mean equal to zero and a variance equal to 1078. 


1. 


What is the probability that the value of the noise exceeds 1074? What is the 
probability that it exceeds 4 x 10-4? What is the probability that the noise value 
is between —2 x 1074 and 1074? 


. Given that the value of the noise is positive, what is the probability that it exceeds 


1074? 


. This noise passes through a half-wave rectifier with characteristics 


vee x, x>0 
Be Nie age" 


Find the PDF of the rectified noise by first finding its CDF. Why can we not use 
the general expression in Equation (5.1.11)? 


4. Find the expected value of the rectified noise in the previous part. 


. Now assume that the noise passes through a full-wave rectifier defined by g(x) = 


|x|. Find the density function of the rectified noise in this case. What is the 
expected value of the output noise in this case? 


5.12 X is a N(0, o?) random variable. This random variable is passed through a systen 
whose input-output relation is given by y = g(x). Find the PDF or the PMF of th 
output random variable Y in each of the following cases: 


1. Square law device, g(x) = ax’. 


2 


2. Limiter, 


—b, x<-b 
g(x)={ b, x=b 
x, |x| <b 


3. Hard limiter, 


a, x>0O 
g(x) = {0, x=0. 
b, x <0 
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4. Quantizer, g(x) = Xn fora, < x < anti, 1 < n < N, where x, lies in 


the interval [an, a,41], the sequence {a), a2, . . . , an+1} satisfies the conditions 
a, = —0O, an; = 00, and fori > j, we havea; > aj. 
5.13 The random variable ® is uniformly distributed on the interval =F, zl Find the 


probability density function of X = tan ®. Find the mean and the variance of X. 





5.14 Let Y be a positive-valued random variable, i.e., fy (y) = O fory <0. 





1. Let œ be any positive constant, and show that P(Y > a) < EN (Markov 
inequality). , 

2. Let X be any random variable with variance o*, and define Y = (X — E[X D? 
and œ = e° for some e. Obviously, the conditions of the problem are satisfied 
for Y and g as chosen here. Derive the Chebychev inequality 





2 
P(X- EQ) > 6) < Z. 
€ 


5.15 Show that for a binomial random variable, the mean is given by np and the variance 
is given by np(1 — p). 





5.16 Show that for a Poisson random variable defined by the PMF P(X =k) = Hema, 
where k = 0, 1,2,... and à > 0, we have E(X) = A and VAR(X) = À. 


5.17 Let X denote a Gaussian random variable with a mean equal to zero and a variance 
equal to a”. Show that 


0, n=2k+1 


E[x"]= 
1x3x5x---x(n—Do", n=2k 


x2 
(Hint: Differentiate the identity h e 22 dx = 0.5/2n02, k times.) 


5.18 Two random variables X and Y are distributed according to 


Kœ +y), O<x,y<1 
0, otherwise 


fxy(x, y) = | 


1. Find K. 
—2. Whatis the probability that X + Y > 1? 


3. Find P(X > Y). 

4. Whatis P(X > Y|X +2Y > 1)? 
5. Find P(X =Y). 

6. What is P(X > 0.5|X = Y)? 

7. Find fx (x) and fy(y). 

8 


. Find. fx (x|X + 2Y > 1) and E(X|X + 2Y > 1). 
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5.19 Let X1, X2,..., Xn denote independent and identically distributed random variables 
each with PDF fy (x). 4 


1. fY = min{X1, X2,..., Xn}, find the PDF of Y. 
2. If Z = max{X, X2,..., Xn}, find the PDF of Z. 


5.20 Show that for the Rayleigh density function 


2 
2X 
a ae 


Co 
0, otherwise 
we have E(X) = o /Z and VAR(X) = (2 — §) 0°. 


5.21 Let X and Y be independent random variables with 


fe) ae, x>0 
x)= a 
S 0, otherwise 
and 
= Be, x>0 
Fo) = k otherwise 


where «œ and £ are assumed to be positive constants. Find the PDF of X + Y and treat 
the special case œ = B separately. i 


5.22 Two random variables X and Y are distributed according to 


Ke*"-¥, x>y>0 


0, otherwise ` 


Ixy (x, y) = | 


. Find the value of the constant K. 

. Find the marginal probability density functions of X and Y. 

. Are X and Y independent? 

. Find fxr (rly). 

. Find E(X|Y = y). 

. Find COV(X, Y ) andĝx y. 

5.23 Let © be uniformly distributed on [0, x], and let the random variables X and Y be 


defined by X = cos © and Y = sin ©. Show that X and Y are uncorrelated, but are 
not independent. 


nn A WB NY = 


5.24 Let X and Y be two independent Gaussian random variables, each with mean zerc 
and variance 1. Define the two events E (r) = {X > rand Y > r} and E2(r) = 
{V X2 + Y? > 2r}, wherer is any nonnegative constant. 
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1. Show that E; (r) C Ez(r); therefore, P (E1 (r)) < P (E2(r)). 
2. Show that P (E1(r)) = Q2(r). 


3. Use the relations for rectangular to polar transformation to find P (E2(r)) and 


conclude with the bound > 


e 2 


NI = 


Qr) < 


on the Q-function. 


Pe 


5.25 It can be shown that the Q-function can be well approximated by 


5.26 


5.27 


5.28 


N 


$: 


e T 


Q(x) x F 





(bit + bot? + bat? + bat* + bst’), 


re ee | 
where t = T and 


p = 0.2316419; 
by = 0.31981530; 
by = —0.356563782; 
b3 = 1.781477937; 
b4 = —1.821255978; 
bs = 1.330274429. 
Using this relation, write a computer program to compute the Q-function at any 


given value of its argument. Compute Q(x) for x = 1, 1.5, 2, 2.5, 3, 3.5, 4, 4.5, 5, 
and compare the results with those obtained from the table of the Q-function. 


Let X and Y be independent Gaussian random variables, each distributed according 
to V(0, o?)..Define Z = X +Y and W = 2X — Y. What can you say about the joint 
PDF of Z and W? What is the covariance of Z and W? 


Let X and Y be two jointly Gaussian random variables with means mx and my, 
variances oe and ož, and a correlation coefficient px y. Show that fx (x|y) is a. 
Gaussian distribution with a mean my + pox (y — my) and a variance of (i Py, y) 
What happens if o = 0? What happens if pọ = +1? 


Let X and Y be zero-mean jointly Gaussian random variables, each with variance 
o”. The correlation coefficient between X and Y is denoted by p. Random variables 
Z and W are defined by 


Z = X cos + Y sin0 
W = -X sin0 + Y cos0 `° 


where 0 is a constant angle. 
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1. Show that Z and W are jointly Gaussian random variables. 


2. For what values of @ are the random variables Z and W independent? 


5.29 Two random variables X and Y are distributed according to 


2 


> 2 
nend eanan, 
0, xy <0 

1. Find K. 
2. Show that X and Y are each Gaussian random variables. 
3. Show that X and Y are not jointly Gaussian. 
4. Are X and Y independent? 
5. Are X and Y uncorrelated? 4 
6. Find fxıy(xly). Is this a Gaussian distribution? 


5.30 Let X and Y be two independent Gaussian random variables with a common variance 
a”. The mean of X is m and Y is a zero-mean random variable. We define-random 
variable V as V = /X2 + Y2. Show that 


v mv -tm 0 
=> = 20 > 
oe Ig (z ) e > v 


0, v<0 


1 2x 1 x 
h(x) = zl eX 8S" du = zl eX OS" du 
2x Jo 27 Jx 


is called the modified Bessel function of the first kind and zero order. The distribution 
of V is known as the Rician distribution. Show that, in the special case of m = 0, the 
Rician distribution simplifies to the Rayleigh distribution. 


> 


fv(v) = 


where 


5.31 A coin has the probability of landing on a head equal to } and is flipped 2000 times. 


1. Using the law of large numbers, find a lower bound to the probability that the 
total number of heads lies between 480 and 520. 


2. Using the central limit theorem, find the probability that the total number o! 
heads lies between 480 and 520. 


5.32 Find m x(t) for the random process X (t) given in Example 5.2.3. Is the result inde- 
pendent of t? 


5.33 Let the random process X (t) be defined by X(t) = A + Bt where A and B ar 
independent random variables, each uniformly distributed on [—1, 1]. Find m x(t, 
and Rx (tı, t2). 
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5.34 Show that the process given in Example 5.2.3 is a stationary process. 


5.35 Which one of the following functions can be the autocorrelation function of a random 
process and why? 


1. f(t) = sin(2x fot). 
2. f(t), as shown in Figure P-5.35. 


fC) 





Figure P-5.35 


5.36 Is the process of Example 5.2.5 stationary? 


5.37 A random process Z(t) takes the values 0 and 1. A transition from 0 to 1 or from 1 
to 0 occurs randomly, and the probability of having n transitions in a time interval of 
duration t (t > 0) is given by 


1 at \" 
= =0,1,2,..., 
Pu(n) l+at (Fa) ý 


where œ > 0 is a constant. We further assume that at t = 0, X (0) is equally likely to 
be Oor 1. , 





1. Find mz(t). 
2. Find Rz(t +1, t). Is Z(t) stationary? 
_ 3. Determine the power spectral density of Z(t). 


5.38 The random process X (t) is defined by 
X(t) = X cos 2v fot + Y sin 27 fot, 


where X and Y are two zero-mean independent Gaussian random variables each with 


the variance o”. 
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1. Find mx(t). 
2. Find Ry (¢ +t, t). Is X (t) stationary? 
3. Find the power spectral density of X (t). 


4. Answer Parts 1 and 2 for the case where of = a}. 


7 


5.39 Show that for jointly stationary processes X (t) and Y (t), we have Rxy(t) =Ryx(—7T). 
From this information, conclude that Syy(f) = Sx (f). ; 


5.40 A zero-mean white Gaussian noise process with the power spectral density of o 
passes through an ideal lowpass filter with bandwidth B. 
1. Find the EE of the output process Y(t). 


2. Assuming T = 5> find the joint probability density function of the random 
variables Y (a and Y(t + t). Are these random variables independent? 


5.41 Findthe output autocorrelation function for a delay line withdelay A when the input 
is a stationary process with the autocorrelation Ry (r). Interpret the result. 


5.42 We have proved that when the input to an LTI system is stationary, the output is also 
stationary. Is the converse of this theorem also true? That is, if we know that the 
output process is stationary, can we conclude that the input process is necessarily 
stationary? 


5.43 Generalize the result of Example 5.2.24 when X (t) is stationary. 


1. Show that X (t) and 4 ar X (£) are uncorrelated processes. 


2. Show that the power veer of Z(t) = X(t) + £X (t) is the sum of the power 
spectra of X(f) and 4 7X (6). 


3. Express the power Bo of the sum in terms of the power spectrum of X (t). 


5.44 Assume X(t) is a stationary process with the power spectral density Sx(f). This 
process passes through the system shown in Figure P-5.44. 


1. Is Y(t) stationary? Why? 
2. What is the power spectral density of Y (t)? 


3. What frequency components cannot be present in the output process and why? 


x XO | ost | ¢ | YO 
L veny = T 
Figure P-5.44 
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5.45 The stationary random process X (t) has a power spectral density denoted by Sx (f). 
¢ 


1. What is the power spectral density of Y(t) = X(t) — X(t — T)? 
2. What is the power spectral density of Z(t) = X’ (t) — X(t)? 
3. What is the power spectral density of W(t) = Y(t) + Z(t)? 


5.46 Show that for two jointly stationary processes X (t) and Y (t), we have 
1 
[Rxy(t)| < V Rx(0)Rry (0) < z Rx) + Ry(0)]. 


5.47 The stationary process X (t) is passed through an LTI system, and the output process 
is denoted by Y (t). Find the output autocorrelation function and the cross-correlation 
function between the input and the output processes in each of the following 
cases: 


1. A delay system with delay A. 

2. A system with h(t) = L, 

3. A system with h(t) = e~% u(t), where œ > 0. 
4. A system described by the differential equation 


d d 

—Y(t)+ Y(t) = —X(t) — X(t). 

EP (t) + ¥(@) a (t) — X(t) 
5. A finite-time average defined by the input-output relation 


1 t+T 
O= F F x(t)dr, 


where T is aconstant. 


5.48 Give an example of two processes X (t) and Y (t) for which Rxy (t+, t) is a function 
of t, but X (t) and Y (t) are not stationary. 


Q X(t) denotes a zero-mean WSS Gaussian random process with power spectral 
| density 
| 


aye 
Sx(f) =4 x 10 «10a (zf) 


1. What is the power in this process? 
2. What is the bandwidth of this process? 


3. Assuming that this process passes through an ideal lowpass filter with a band- 
width of 50 kHz and the output is denoted by Y (t), determine Sy (f ), the power 
spectral density of Y (t), and the total power in the output process. 






























































246 Probability and Random Processes Chapter 5 


4. Determine the PDF (probability density function) of the random variable X (0). 
5. Find the smallest value of to > 0 such that X (0) and X (fo) are independent. 


5.50 X(t) denotes a zero-mean WSS Gaussian random process with autocorrelation 
function 
Rx(t) = 4sinc*(10*r) 


. What is the power in this process? 

. Determine the power spectral density, Sy(f), for this process. 

. What is the bandwidth of this process? 

. Assuming that this process passes through an ideal lowpass filter with a band- 
width of 5 kHz and the output is denoted by Y (t), determine Sy (f), the power 
spectral density of Y(t), and the total power in the output process. 

5. Determine the PDF of random variables X (0), X (107%, and X (1.5 x 1074). 
6. Show that random variables X (0) and X(10~*) are independent but X(0) and 

a, X(1.5 x 1074) are dependent. 


hw nN = 


(551 ‘A WSS Gaussian random process X (t) has mean mx = 2 and power spectral density 


j 
f 


ay J 


ZA 107° |f| < 200 
Sx(f) = A 


0 otherwise 


This random process passes through an LTI system shown below. The output of the 
system is denoted by Y(t). 


1. Whatis my, the mean of Y (t)? 

. What is Sy (f), the power spectral density of Y (t)? 

. What is Py, the power in Y (t)? 

. Is Y(t) WSS? Why? 

. Is Y(t) Gaussian? Why? 

. Consider the random variable Y (1), the output at time t = 1. What is the PDF 
of this random variable? —/ 


An bh WN 


a 
dt 


X(t) 
Delay =1 — Y(t) 


Nie 


5.52 In the block diagram shown below, X(t) denotes a zero-mean WSS (wide-sense 
stationary) and white random process with power spectral density Sy(f) = Mo 
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X() 


Y(t)=X 2X' 
ee ere JO 


The block denoted by LPF represents an ideal lowpass filter that passes all frequen- 
cies in the frequency range from — W to W and blocks all other frequencies. Answer 
the following questions, your answers will be in terms of No. 


1. What is the power spectral density and the mean of Y (t)? 
2. What is the power spectral density of Z(t)? 

3. Is Z(t) a WSS random process? Why? 

4. Whatis the variance of Z(t) if W = 4? 

5. What is the power in Y(t)? 


5.53 Find the power spectral density for each of the following processes: 


1. X(t) = Acos(2n fot + ©), where A is a constant and © is a random variable 
uniformly distributed on [0, 4]. 


2. X(t) = X + Y, where X and Y are independent, X is uniform on [—1, 1], and 


{>  Y is uniform on [0, 1]. 

f Ni 

5.54 IX (t) is a stationary random process with the autocorrelation function Ry(t) = 
no ; 


e~l"|, where œ > 0. This process is applied to an LTI system with h(t) = e~P*u(t), 
where 6 > 0. Find the power spectral density of the output process Y (t). Treat the 
cases a 4 B anda = B separately. 


5.55 Let Y(t) = X(t) + M(t), where X(t) and N (t) are signal and noise processes. It 
is assumed that X (t) and N (t) are jointly stationary with the autocorrelation func- 
tions Rx(t) and Ry (T) and cross-correlation function Ryn (T). We want to separate 
the signal from the noise by passing Y (t) through an LTI system with the impulse 
response A(t) and the transfer function H (f). The output process is denoted by X(t), 
which we want to be as close to X (t) as possible. 


1. Find the cross correlation between X (t) and X(t) in terms of h(t), Rx(t), 
Ry (1), and Ryn (T). 
2. Show that the LTI system that minimizes E[X (t)— È (t)]? has a transfer function 


Sx(f) +Sxn(f) 
Sy(f) + Sn (f) + 2Re[Sxn(f)] 


3. Now assume that X (t) and N (t) are independent and N(t) is a zero-mean white 
Gaussian process with the power spectral density %. Find the optimal H(f) 


A(f) = 
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under these conditions. What is the corresponding value of E[X (t) — Xx (t)}* in 
this case? 


4. In the special case of Sv(f) = 1, Sx(f) = ewe and Syyv(f) = 0, find the 


optimal H(f). 


5.56 Inthis problem, we examine the estimation of arandom process by observing another 


random process. Let X (t) and Z(t) be two jointly stationary random processes. We 
are interested in designing an LTI system with the impulse response h(t), such that 
when Z(t) is passed through it, the output process xX (t) is as close to X (t) as possi- 
ble. In other words, we are interested in the best linear estimate of X (t) based on the 
observation of Z(t) in order to minimize E[ X(t) — È OP. 


1. Let us assume we have two LTI systems with the impulse responses A(t) and 


g(t). Z(t) is applied to both systems and the outputs are denoted by X (t) and 
X(t), respectively. The first filter is designed such that its output satisfies the 
condition : 

E RO ~ MZE- r)| =0 


for all values of t and t, whereas the second filter does not satisfy this property. 
Show that . > 
E[X(t) — XP > ELX@) - XP, 


i.e., the necessary and sufficient condition for an optimal filter is that its outpu 
satisfies the orthogonality condition, as given by 


E RO — MZE- r)| =0, 


which simply means that the estimation error e(t) = X(t) — È (t) must be 
orthogonal to the observable process Z (t) at all times. 


2. Show thatthe optimal A(t) must satisfy 


Ryz(t) = Rz(t) x h(t). 


3. Show that the optimal filter satisfies 


Sxz(f) 


say 


4. Derive an expression for E[e*(t)] when the optimal filter is employed. 


5.57 What is the noise equivalent bandwidth of an ideal bandpass filter with bandwidt! 


5.58 A zero-mean white Gaussian noise, n(t), with power spectral density o is passe 


through an ideal filter whose passband is from 3 kHz to 11 kHz. The output proces 
is denoted by n (t). 
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1. If fo = 7 kHz, find Sa. (f), Sns (f), and Rnen, (T), Where n,(t) and ns (t) are the 
in-phase and quadrature components of n(t). 


2. Repeat Part 1 with fo = 6 kHz. 


5.59 Let p(t) be a bandpass signal with the in-phase and quadrature components p,(t) 
and ps(t); let X(t) = Seo A np(t — nT), where A,,’s are independent random 
variables. Express X(t) and X(t) in terms of p,(t) and p,(t). 


5.60 Let X(t) be a bandpass process and let ee Show that for all 
choices of the center frequency fo, V (¢)1émains unchanged. 


5.61 Let n, (t) be a zero-mean white Gaussian noise with the power spectral density M, 
let this noise be passed through an ideal bandpass filter with the bandwidth 2 W 
centered at the frequency fs. Denote the output process by n (t). 


1. Assuming fo = fe, find the power content of ne in- i phage and quadrature com- 
ponents of n(t). ne =S= d No 


V i 
2. Find the density function of V (t), the envelope of n (t). Ne QN (Jz “(VI [ 9 2A Ww ) 
pade, bp 
3. Now assume X(t) = Acos2nfot + n(t), where A is a ‘Constant. What is the 


r obabilit dep şit f aeng of the enve of X 0)? 
o AAR TZ ee as EBA) y B cal A ane) 
Jey = (AHN ENG 
5.62 A noise process has a power pectral Yensity given by 
10-8 (1-4). IfI < 108 oe veh ANT 


0, IfI > 108 


Sn(f) = 


This noise is passed through an ideal bandpass filter with a bandwidth of 2 MHz, 
centered at 50 MHz. 





1. Find the power content of the output process. 210° 


2. Write the output process in terms of the in-phase and quadrature components, 
and find the power in each component., Assume fp = 50 MHz. 


3. Find the power spectral density of the in-phase and quadrature components. 


4. Now assume that the filter is not an ideal filter and is described by 


TNE ANTON ETRY TEE REY AES EMT BP TEE 


|f|—49 x 10° 49MHz < |f| <51 MHz 
0, otherwise i 


IHP = 


Repeat Parts 1, 2, and 3 with this assumption. 
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COMPUTER PROBLEMS 





5.1 Generation of Random Variables 


The objective of this problem is to generate a random variable X that has a linear 
probability density function as shown in Figure CP-5.1 (a); i.e., 





\ Xx O0<x<2 

ie —)2 S432 

| Iae | 0 otherwise 

l | The corresponding probability distribution function is 
| 5 0 x<0 
| F(x) = PAs = | fe@)dv=4% O<x<2. 
| T0 i x>2 








Fy (x) is shown in Figure CP-5.1(b). Note that 0 < Fx(x) < 1. 








+ FQ) | F@) 
1 
i} 
x fi 
ie 4 i Figure CP-5.1 (a) A linear 

i i probability density function 

| — i and (b) the corresponding 

j 0 2 X probability distribution 

(b) function. 





To generate a sequence of samples of the random variable X, say {X;}, we use MAT- 
LAB to generate uniformly distributed random variables {u;} in the range (0, 1). 


Then, we set 


x2 


| F(x) = F 5u 


and solve for x;. Thus, x; = 2,/u;. Clearly, the range of x; is (0, 2). 














1. Generate 10,000 samples of the random variable X by using the procedure pre- 
viously described. 


2. By subdividing the interval (0, 2) into 20 equally spaced subintervals (bins). 
count the number of samples {x;} that fall into each bin and plot a histogram oi 
the 10,000 randomly generated samples. Compare the histogram with the linea 
probability density function fy(x) and comment on how well the histograr 
matches fx (x). 





















































Computer Problems 251 


5.2 Generation of Gaussian Random Variables 


Additive noise encountered in communication systems is characterized by the Gaus- 
sian probability density function 





1 <x? 
fx) = e 2%, —0 <x <, 
V2n0 . 


as shown in Figure CP-5.2(a), where ø? is the variance of the random variable X. 
The probability distribution function, shown in Figure CP-5.2(b), is 


F(x) = PX sx) | fx(v) dv, —o0 <x < œ. 


fœ) ; F(x) 
1 
2 
0 x 0 x 
(a) (b) 


Figure CP-5.2 (a) The Gaussian probability density function and (b) the corresponding probability 
distribution function. i 


The objective of this problem is to generate a sequence {x;} of Gaussian-distributed 
random variables on the computer. If there was a closed-form expression for Fy (x), 
we could easily generate a sequence of uniformly distributed random variables {u;} 
and let Fy (x;) = u;. Then, we solve for x; by performing the inverse mapping. How- 
ever, in this case, Fy (x) cannot be expressed in closed form. Instead, we circumvent 
this problem by first generating random variables that we related to Gaussian ran- 
dom variables. From probability theory, we know that a Rayleigh-distributed random 
variable R, which has a probability distribution function 
r2 
Fg(r) = l—e %27 r>0 
otherwise 


is related to a pair of independent Gaussian random variables X and Y through the 


transformation 
X = RcosO 
Y = RsinO’ 


where © is a uniformly distributed random variable in the interval (0, 2x) and is 
independent from R. The parameter ø? is the variance of X and Y. Therefore, we set 
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Fr(r;) = ui, where {u;} is a sequence of uniformly distributed random variables on 
the interval (0, 1). We can easily invert this equation to obtain 


1 
n= porn ), 
l — ui 


Next, we generate another sequence, say, {v;}, of uniformly distributed random vari- 
ables on the interval (0, 1) and define 





6; = 2mv;. 
Finally, from {r;} and {6;}, we generate the Gaussian random variables using the 
relations 

Xi = ri COS 0; 

yi = r; sind; 


1. By using the preceding procedure, generate 10,000 samples of a Gaussian ran- 
dom variable with a zero mean and a variance o? = 2. 

2. By subdividing the interval (—10, 10) into 20 equally spaced subintervals (bins), 
count the number of samples of {x;} and {y;} that fall into each bin, and plot a 
histogram of the 10,000 randomly generated samples. Compare the histograms 
with the Gaussian probability density function fy (x), and comment on how well 
the histogram matches fx (x). 


5.3 Autocorrelation Function and Power Spectrum 


The objective of this problem is to compute the autocorrelation and the power spec- 
trum of a sequence of random variables. 


1. Generate a discrete-time sequence {x,} of N = 1000 statistically independen 
and identically distributed random variables, which are selected from a uniform 
distribution over the interval (-3, 5). The estimate of the autocorrelation of the 
sequence {x,} is defined as 


N-m 
1 
Rx(m) = N m pe are m=0,1,...,M 
n=l 


N 
1 
= — ’ = —1, —2, ...,—M, 
N — |m] D m 
where M = 100. Compute Rx (m) and plot it. 
2. Determine and plot the power spectrum of the sequence {xn} by computing th 
discrete Fourier transform (DFT) of Rx (m), which is defined as 


bal janfm 
Sx(f) = D> Ramer T, 


m=—M 
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The fast Fourier transform (FFT) algorithm may be used to efficiently compute 
the DFT. 


5.4 Filtered White Noise 


A whiterandom process X (t) with the power spectrum Sx (f) = 1 forall f excites 
a linear filter with the impulse response 


à ee’ t>0 
u= 0 otherwise ` 


1. Determine and plot the power spectrum Sy (f) of the filter output. 


2. By using the inverse FFT algorithm on samples of Sy (f), compute and plot the 
autocorrelation function of the filter output y(t). To be specific, use N = 256 
frequency samples. 


5.5 Generation of Lowpass Random Process 


The objective of this problem is to generate samples of a lowpass random process by 
passing a white-noise sequence {x,} through a digital lowpass filter. The input to the 
filter is a sequence of statistically independent and identically distributed uniform 
random variables on the interval (-4, 5). The digital lowpass filter has an impulse 
response 

(0.9” n>0 


h(n) = 
0 otherwise 


and is characterized by the input-output recursive (difference) equation 
Yn = 0.9Yn-1 + Xn, n=l, y-1=0. 


1. Generate a sequence {x,} of 1000 samples and pass these through the filter to 
generate the sequence {y,}. 


2. Using the basic formula in Computer Problem CP-5.3, compute the autocorre- 
lation functions Rx(m) and Ry(m) for |m| < 100. Plot Ryx(m) and Ry(m) on 
separate graphs. 

3. Compute and plot the power spectrum Sx(f) and Sy(/) by evaluating the dis- 
crete Fourier transform of Rx(m) and Ry(m). 


5.6 Generation of Bandpass Random Process 


A bandpass random process X (t) can be represented as 
X(t) = X-(t) cos 2v fet — X5(t) sin 27 fet, 


where X,(t) and X;(t) are called the in-phase and quadrature components of X (t). 
The random processes X-(t) and X s(t) are lowpass processes. 
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cos 2nf,n 


X(n) 


X(n) = X(n) cos 2xf,n — X,(n) sin 2n fn 


X(n) 


—sin 2nf,n 


Figure CP-5.6 Generation of a bandpass random process. 


1. Generate 1000 samples of a bandpass random process X (t) by first generating 
1000 samples of two statistically independent lowpass random processes X(t) 
and X(t), and then use these to modulate the quadrature carriers cos 21 ft and 
sin 2x fst, as shown in Figure CP-5.6. The carrier frequency is fe = 1000/n. 
The input sequence into the digital lowpass filter has statistically mde pendini 


and identically distributed uniform random variables on the interval (— z> 2) 


2. Use the formula given in Computer Problem CP-5.3 to compute and plot the 
autocorrelations Rx, (m), Rx, (m), and Rx(m) for |m| < 100. 


3. Compute and plot the power spectra of Sx, (f), Sx, (f), and Sx (f) by evaluating 
the discrete Fourier transform of Rx, (m), Rx, (m), and Rx(m). 
















































































































































































Effect of Noise 
on Analog 
Communication 
Systems 


In Chapters 3 and 4, we studied the important characteristics of analog communication sys- 
tems. These characteristics included time domain and frequency domain representations 
of the modulated signal, bandwidth requirements, power content of the modulated sig- 
nal, and the modulator and demodulator implementation of various analog communication 
systems. f 

In this chapter, the effect of noise on various analog communication systems will 
be analyzed. As we will see, angle modulation and frequency modulation (FM) systems, 
in particular, can provide a high degree of noise immunity; therefore, they are desirable 
in cases of severe noise and/or low signal power. This noise immunity is obtained at the 
price of sacrificing channel bandwidth because, as we have seen in Chapter 4, the band- 
width requirements of angle modulation systems are considerably higher than the required 
bandwidth of amplitude modulation (AM) systems. 

This chapter starts with a performance analysis of linear modulation systems in the 
presence of noise. Then, the effect of noise on angle modulation systems is discussed. 
Finally, we analyze the effects of wansmission loss and noise on analog communication 
systems in general. f 


6.1 EFFECT OF NOISE ON AMPLITUDE MODULATION SYSTEMS 


In this section, we determine the signal-to-noise ratio (SNR) of the output of the receiver 
that demodulates the amplitude-modulated signals. In evaluating the effect of noise on the 
various types of analog carrier-modulated signals, it is also interesting to compare the result 
with the effect of noise on an equivalent baseband communication system. We begin the 
evaluation of the effect of noise on a baseband system. 


255 


















































256 Effect of Noise on Analog Communication Systems Chapter 6 
6.1.1 Effect of Noise on a Baseband System 


Since baseband systems serve as a basis for comparison of various modulation systems, 
we begin with a noise analysis of a baseband system. In this case, there is no carrier 
demodulation to be performed. The receiver consists only of an ideal lowpass filter with 
the bandwidth W. The noise power at the output of the receiver, for a white noise input is 
therefore 


= NoW. (6.1.1) 


=) = 2r 6.1.2 
€ » Now E2) 





Example 6.1.1 


Find the SNR in a baseband system with a bandwidth of 5 kHz and with Nọ = 107!4 W/Hz. 
The transmitter power is 1 kilowatt and the channel attenuation is 107??. 


Solution We have Pg = 107!? P} = 10-!2 x 10? = 107° Watts. Therefore, 





S\ — Pr _ 107° -20 
NJ, NoW 1074x5000 ` 
This is equivalent to 10 log;, 20 = 13 dB. a 


6.1.2 Effect of Noise on DSB-SC AM 


In double-sideband suppressed-carrier amplitude modulation (DSB-SC AM), the transmit- 
ted signal is 


u(t) = A,m(t) cos(2n fet); (6.1.3) 


therefore, the received signal at the output of the receiver noise-limiting filter is the sum of 
this signal and filtered noise. Recall from Section 5.3.3 that a filtered noise process can be 
expressed in terms of its in-phase and quadrature components, as given in Equation (5.3.7). 
Adding the filtered noise to the modulated signal, we can express the received signal as 


r(t) = u(t) +n) 
= A,m(t) cos(27 fet) +n,(t) cos(2n fet) — ns(t) sin(2x fet). (6.1.4) 


Suppose we demodulate the received signal by first multiplying r (t) by a locally generated 
sinusoid cos(2x fet +), where @ is the phase of the sinusoid, and then passing the product 
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signal through an ideal lowpass filter having a bandwidth W. The multiplication of r(t) 
with cos(2nf-t +) yields 


r(t)cos(2n fet +) = A,m(t) cos(2n fet) cos(2n fet + Q) + n(t) cos(2n fet + Q) 


= LAm) cos(ġ) + sAcm() cos(4n fet + $) 


1 
+ zO cos(ġ) + ns (t) sin(ġ)] 


+ ; [n,(t) cos(4n fet + Q) —ngs(t)sin(4nfit+)]. (6.1.5) 


The lowpass filter rejects the double-frequency components and passes only the lowpass 
components. Hence, its output is 


1 1 
y(t) = zAem(t) cos(ġ) + 5 [ne (t) cos(@) + ns(t) sin(ġ)]. (6.1.6) 


As was discussed in Chapter 3, the effect of a phase difference between the received 
carrier and a locally generated carrier at the receiver is a drop equal to cos? ($) in the 
received signal power. This can be avoided by employing a phase-locked loop, as will be 
described in Chapter 8. The effect of a phase-locked loop is to generate a sinusoidal carrier 
at the receiver with the same frequency and phase of the received carrier. If a phase-locked 
loop is employed, then ¢ = 0 and the demodulator is called a coherent or synchronous 
demodulator. In our analysis in this section, we assume that we are employing a coherent 
demodulator. With this assumption, without loss of generality, we assume that ¢ = 0; 
hence, 


1 
yt) = 5 [Acm(t) + n-(t)]. (6.1.7) 


Therefore, at the receiver output, the message signal and thenoise components are additive 
and we are able to define a meaningful SNR. The message signal power is given by ` 


2 


A 
Po= q Pm (6.1.8) 


where Py is the power content of the message signal. The noise power is given by 


1 
where we have used the fact that the power contents of ne(t) and n(t) are equal. This was 
discussed, in Equation (5.3.9) in Section 5.3.3. The power content of n(t) can be found by 
noting that it is the result of passing n,,(¢) through a filter with bandwidth Be. Therefore, 
the power spectral density of n(t) is given by 


No 
S.(f)= 10 f= fl <W 


0 otherwise 


(6.1.10) 
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The noise power is 


=2WN. (6.1.11) 


Now we can find the output SNR as 











=. (6.1.12) 


2 
In this case, the received signal power, as given by Equation (3.2.2), is Pr = Ace M There- 


fore, the output SNR in Equation (6.1.12) for DSB-SC AM may be expressed as 





(=) ae (6.1.13) 
N ODSB NoW 


which is identical to (S/N),, which is given by Equation (6.1.2). Therefore, in DSB-SC 
AM, the output SNR is the same as the SNR for a baseband system. In other words, DSB- 
SC AM does not provide any SNR improvement over a simple baseband communication 
system. 


6.1.3 Effect of Noise on SSB AM 


In this case, the modulated signal, as given in Equation (3.2.8), is 
u(t) = Aem(t) cos(2n f-t) F AcM (t) sin(2nf,t). (6.1.14) 
Therefore, the input to the demodulator is 
r(t) = Aem(t) cos(2n fet) F A-m(t) sin(2r fet) + n(t) 
= (Aem(t) + nc(t))cos(27 fet) + (FA-M(t) — ns(t))sin(27 fet). (6.1.15) 


Here we assume that demodulation occurs with an ideal phase reference. Hence, 
the output of the lowpass filter is the in-phase component (with a coefficient of 3) of the 
preceding signal. That is, 


y(t) = feme) + sre(t) (6.1.16) 
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We observe that, in this case again, the signal and the noise components are additive, and a 
meaningful SNR at the receiver output can be defined. Parallel to our discussion of DSB, 
we have 








A2 
P, = q Pm (6.1.17) 
and 
1 1 
Pro = GP ne = GP ns (6.1.18) 
where 
o0 No 
P, = Sa (f)df = z * 2W = WN. (6.1.19) 
—0o 
Therefore, 
S P,  A2Py 
=] = = j 12 
(F). 7 = N 6120 
But in this case, : 
Pr = Py = A2Py; (6.1.21) 
thus, 
S P S 
(5) =- = (=) ; (6.1.22) 
N Josss WNo N Jy 


Therefore, the signal-to-noise ratio in a single-sideband system is equivalent to that of a 
DSB system. f 


6.1.4 Effect of Noise on Conventional AM 


In conventional DSB AM, the modulated signal was given in Equation (3.2.6) as 
u(t) = Ac [1 + am(t)] cos 27 fet. (6.1.23) 
Therefore, the received signal at the input to the demodulator is 
r(t) = [Ac [1 + amn (t)] + nc(t)] cos 2x fet — ns (t) sin 27 fet, (6.1.24) 


where a is the modulation index and m, (t) is normalized so that its minimum value is —1. 
If a synchronous demodulator is employed, the situation is basically similar to the DSB 
case, except that we have 1 + am,(t) instead of m(t). Therefore, after mixing and lowpass 
filtering, we have 
1 

yı) = 3 [Ac [1 + am,(t)] +n-(t)]. (6.1.25) 
However, in this case, the desired signal is m(t), not 1 + am,(t). The DC component in the 
demodulated waveform is removed by a DC block and, hence, the lowpass filter output is 

Ne(t) 


y(t) = 5 Acamy(t) + 2 (6.126) 
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In this case, the received signal power is given by 


A2 
Pr= F [1+a’Pu, |, (6.1.27 


where we have assumed that the message process is zero mean. Now we can derive thi 


output SNR as 
G) 44a Pu, 
N OAM E Pae 
KA A2a’ Py, 
~ 2NoW 


2 
= a? Py, & [1 + a’ Pu, | 
1 +a? Py, NoW 
a? Py, Pr 
1+a*Pu, NoW 


n. a’ Py, S 

~ 1+a@Py, \N/, 
S 

N faao ae 6.1.28 
(5), 


where we have used Equation (6.1.2) and 7 denotes the modulation efficiency. 

We can see that, since a’ Py, <1+ a? Py, the SNR in conventional AM is alway 
smaller than the SNR in a baseband system. In practical applications, the modulation inde: 
a is in the range of 0.8-0.9. The power content of the normalized message process depend 
on the message source. For speech signals that usually have a large dynamic range, Py i 
in the neighborhood of 0.1. This means that the overall loss in SNR, when compared to . 
baseband system, is a factor of 0.075 or equivalent to a loss of 11 dB. The reason for thi 
loss is that a large part of the transmitter power is used to send the carrier component o 
the modulated signal and not the desired signal. 

To analyze the envelope-detector performance in the presence of noise, we must us 
certain approximations. This is a result of the nonlinear structure of an envelope detecto: 
which makes an exact analysis difficult. In this case, the demodulator detects the envelop 
of the received signal and the noise process. The input to the envelope detector is 





r(t) = [Ac [1 + ama(t)] + n-(t)] cos 2x fet — ns (t) sin 27 fet; (6.1.29 


therefore, the envelope of r (t) is given by 


V) = y [Ae [1 + am, (D) +OP +720). (6.1.30 
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Now we assume that the signal component in r (t) is much stronger than the noise compo- 
nent. With this assumption, we have 


P (ne(t) K Ac [1+ am, (1) ~ 1; (6.1.31) 
therefore, we have a high probability that 
V, (t) © Ac [1 + am, (t)] + ne(t). (6.1.32) 
After removing the DC component, we obtain 
y(t) = A,am,(t) + ne(t), (6.1.33) 


which is basically the same as y(t) for the synchronous demodulation without the t coef- 
ficient. This coefficient, of course, has no effect on the final SNR; therefore we conclude 
that, under the assumption of high SNR at the receiver input, the performance of syn- 
chronous and envelope demodulators is the same. However, if the preceding assumption is 
not true, we still have an additive signal and noise at the receiver output with synchronous 
demodulation, but the signal and noise become intermingled with envelope demodulation. 
To see this, let us assume that at the receiver input, the noise power! is much stronger than 
the signal power. This means that 





Ve(t) = IAL + am, (0) + nD + n2(0) 





=, A2(1 + amp (t))? + n2 (t) + n2 (t) + 2Acne(t)(1 + am, (t) 


2Aene(t) 


2 | o20 +40) f + 


2) +2) +20 (+ ama()| 

Achc(t) 
V(t) 

Acn-(t) 
Va (t) 





& Vit) E + (+ ama()| 





= W(t) + (1 +am,(t)), (6.1.34) 
where (a) uses the fact that A2(1 +am,(t)) is small compared with the other components 
and (b) denotes ,/n2(t) + n2(t) by V,(t), the envelope of the noise process; we have also 
used the approximation v1 + € + 1+ a for small €, where 
2Acnc(t ) 
= ————_ (1 + t)). 6.1.35 

€ mw) + nH ° am,(t)) ( ) 
We observe that, at the demodulator output, the signal and the noise components are no 
longer additive. In fact, the signal component is multiplied by noise and is no longer dis- 
tinguishable. In this case, no meaningful SNR can be defined. We say that this system is 
operating below the threshold. The subject of threshold and its effect on the performance 


'By noise power at the receiver input, we mean the power of the noise within the bandwidth of the 
modulated signal or, equivalently, the noise power at the output of the noise-limiting filter. 
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of a communication system will be covered in more detail when we discuss the nois 
performance in angle modulation. 


Example 6.1.2 
We assume that the message is a wide-sense stationary random process M(t) with the auto 


correlation function 
Ry(t) = 16sinc?(10,0007). 


We also know that all the realizations of the message process satisfy the conditio! 
max |m(t)| = 6. We want to transmit this message to a destination via a channel with 

50-dB attenuation and additive white noise with the power spectral density S,,(f) = Mo = 
10-!2 W/Hz. We also want to achieve an SNR at the modulator output of at least 50 dB 
What is the required transmitter power and channel bandwidth if we employ the followin, 
modulation schemes? 


1. DSB AM. 
2. SSB AM. 
3. Conventional AM with a modulation index equal to 0.8. 


Solution First, we determine the bandwidth of the message process. To do this, we obtai 
the power spectral density of the message process, namely, 





Sulf) = FIRu 0) = 759994 (a) ; 


which is nonzero for —10,000 < f < 10,000; therefore, W = 10,000 Hz. Now we ca 
determine (+) , as a basis of comparison: 
Pp Pr = 10° Pr 


(5), “MW 2x10-2x10* 2 
Since the channel attenuation is 50 dB, it follows that 
P. 
10log — = 50; 
Pr 
therefore, 
Pr = 10% Pr. 


Hence, 





S\ _105x10Pr  10Pr 
T 2 2 


1. For DSB-SC AM, we have 
S\ [/S\L 10° Py 
NJ), \N), 2 


10° Pr 


~ 50 dB = 10°. 





Therefore, 
= 10° => Pr = 200 Watts 





and 
BW =2W = 2 x 10,000 = 20,000 Hz =% 20 kHz. 


Section 6.2 Effect of Noise on Angle Modulation 263 


2. For SSB AM, 





S S 103 P; 
saa N EE = =10° Pr = Watt 
( ) (Š), 5 10° => Pr = 200 Watts 


and 
BW = W = 10,000 Hz= 10 kHz. 


3. For conventional AM, witha = 0.8, 


S\ Je 10° Pr 
NJ NN, Ta 


where ņ is the modulation efficiency given by 





a? Py, 


"= 1 +a? Pu, 


` First, we find Py,, the power content of the normalized message signal. Since 
max |m(t)| = 6, we have 


Pu Pu 


Py, = —— = Z. 
Mn — (max mAN? 36 
To determine Py, we have 


Py = Ru (7), = 16; 


therefore, 
Big 16 4 
Mn 36 9 
Hence, 
0.8? x $ S 
o> Ae Ogee A 
Therefore, 





S 10 P 
>) ~0.22—— =0.11 x 10 Pr = 10° 
N), 2 


or 
Pr œ% 909 Watts. 


The bandwidth of conventional AM is equal to the bandwidth of DSB AM, i.e., 
BW = 2W = 20 kHz. a 


6.2 EFFECT OF NOISE ON ANGLE MODULATION 


In this section, we will study the performance of angle-modulated signals when contam- 
inated by additive white Gaussian noise; we will also compare this performance with the 
performance of amplitude-modulated signals. Recall that in amplitude modulation, the 
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(a) 


(b) 


Figure 6.1 Effect of noise in frequency modulation. 


message information is contained in the amplitude of the modulated signal; since noise 
is additive, it is directly added to the signal. However, in a frequency-modulated signal, the 
noise is added to the amplitude and the message information is contained in the frequency 
of the modulated signal. Therefore, the message is contaminated by the noise to the exten 
that the added noise changes the frequency of the modulated signal. The frequency of : 
signal can be described by its zero crossings. Therefore, the effect of additive noise on th: 
demodulated FM signal can be described by the changes that it produces in the zero cross 
ings of the modulated FM signal. Figure 6.1 shows the effect of additive noise on the zer 
crossings of two frequency-modulated signals, one with high power and the other with lov 
power. From the previous discussion and also from Figure 6.1, it should be clear that th 
effect of noise in an FM system is different from that of an AM system. We also observi 
that the effect of noise in a low-power FM system is more severe than in a high-powe 
FM system. In a low-power signal, noise causes more changes in the zero crossings. Thi 
analysis that we present in this chapter verifies our intuition based on these observations. 

The block diagram of the receiver for a general angle-modulated signal is shown i 
Figure 6.2. The angle-modulated signal is represented as” 


u(t) = Ac cos (27 fet + o(t)) 
Ac cos (2m ft +2nk 5 1 ee m(t) dz) FM 


= (6.2.1 
A. cos (21 fet + kpm(t)) PM 


?Throughout our noise analysis, when we refer to the modulated signal, we mean the signal as receive 
by the receiver. Therefore, the signal power is the power in the received signal, not the transmitted power. 
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BW =B, BW=W 5) 


u(t) + nyi) Bandpass r(t) = u(t) +n Angle AO) Lowpass (7 9 
filter demodulator filter 


Figure 6.2 The block diagram of an angle demodulator. 





The additive white Gaussian noise n,,(t) is added to u(t), and the result is passed 
through a noise-limiting filter whose role is to remove the out-of-band noise. The band- 
width of this filter is equal to the bandwidth of the modulated signal; therefore, it passes the 
modulated signal without distortion. However, it eliminates the out-of-band noise; hence, 
the noise output of the filter is a filtered noise process denoted by n(t). The output of this 
filter is 


r(t) =u(t) +n(t) 
=u(t)+n,(t)cos(2nf,t) — ns (t) sinQ2n fet). (6.2.2) 


As with conventional AM noise-performance analysis, a precise analysis is quite involved 
due to the nonlinearity of the demodulation process. Let us assume that the signal power 
is much higher than the noise power. Then, the bandpass noise is represented as (see 


Equation 5.3.8) 
n(t) = ,/n2(t) + n2(t) cos (2s + arctan zt) 


= V,(t) cos (2x fet + ®n(t)), (6.2.3) 





where V, (t) and ®,„ (t) represent the envelope and the phase of the bandpass noise process, 
respectively. The assumption that the signal is much larger than the noise means that 


P(V,(t) K Ac) © 1. (6.2.4) 


Therefore, the phasor diagram of the signal and the noise are as shown in Figure 6.3. 
From this figure, it is obvious that we can write 


Im |. 


iro sin (®,(t) — $(t)) 


D 





r() 





1 
4 


“Vy, costa) = 40) 


/ 


Dalt) — AE) / 


A Pal) 
Re 


Figure 6.3 Phasor diagram of an angle-modulated signal when the signal is much stronger than the noise. 
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r(t) © (Ac + Va(t) cos (p (£) — (2) 


X COS (ans -+ (t) + arctan Vp (t) sin (®, (t) — o(t)) ) 


Ac + Va(t) cos (Pn(t) — 90) 


= (Ac + Vn (t) cos (®,(t) — $(¢))) 
Va (t) 
Ac 


X COS (2x4 + (t) + — sin (®,(t) — 90): (6.2. 


The demodulator processes this signal and, depending on whether it is a phase demodulat< 
or a frequency demodulator, its output will be the phase or the instantaneous frequency < 
this signal (the instantaneous frequency is + times the derivative of the phase).? Therefor 


noting that 
t PM 
sea ey (6.2.4 
2nky f o m(t)dt FM 


we see that the output of the demodulator is given by 











ot) + “ne n sin (®,(t) — o(t)) PM 
yt) = ld 

on dt (ve )+—— ae 2 sin (®, (t) -40)) FM 
kpm(t) + Ml n sin (®,,(t) — d(t)) PM 

7 kpm(t)+ 1a ns sin („n (t) — @(t)) FM 
kpm(t) + Yn (t) PM 

= 1d R (6.2. 
kpm(t) + on gn FM 

where we have defined 
y) £ Y no aW sin (®,(t) -oA . (6.2. 


The first term in Equation (6.2.7) is the desired signal component, and the second ter 
is the noise component. From this expression, we observe that the noise component 
inversely proportional to the signal amplitude A.. Hence, the higher the signal level, tl 
lower the noise level. This is in agreement with the intuitive reasoning presented at tl 
beginning of this section and based on Figure 6.1. Note that this is not the case with amp. 
tude modulation. In AM systems, the noise component is independent of the signal cor 
ponent, and a scaling of the signal power does not affect the received noise power. 


3Of course, in the FM case, the demodulator output is the instantaneous frequency deviation of v(t) fr 
the carrier frequency fe. 
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Let us study the properties of the noise component given by 


Vit) .. 


Y,(t) = = ea (®n(t) — $ (6)) 





= mal (£) sin Pa (t) cos Ø (t) — Vn (t) cos ®, (t) sing(o)| 


= — [ns (t) coso (t) — n-(t) sin p(t). (6.2.9) 


Here we provide an approximate and intuitive argument to determine the signal-to- 
noise ratio of the demodulated signal. A more rigorous derivation can be found in the 
references given at the end of this chapter. 

Note that ¢ (t) is either proportional to the message signal or proportional to its inte- 
gral. In both cases, it is a slowly varying signal compared to ne(t) and ns(t), which are 
the in-phase and quadrature components of the bandpass noise process at the receiver and 
have a much higher bandwidth than ¢ (t). In fact, the bandwidth of the filtered noise at 
the demodulator input is half of the bandwidth of the modulated signal, which is many 
times the bandwidth of the message signal. Therefore, we can say that when we compare 
variations in n,(t) and ns;(t), we can assume that ¢ (t) is almost constant, i.e., P(t) ~ ¢. 
Therefore, 


Y,(t) = 7 [ns(t) cos @ — n-(t) sing]. (6.2.10) 


Now notice thatin this case, fe is the axis of symmetry of the bandpass noise process. 
Therefore, the conditions leading to the result of Example 5.3.3 are valid with a = re 





and b = =t, By using the result of Example 5.3.3, we have 


Sne ( f) 


Sra (f) = (8? + b°)Sn.(f) = = 





(6.2.11) 


where S,,.(f) is the power spectral density of the in-phase component of the filtered noise 
given in Equation (5.3.10). Note that the bandwidth of the filtered noise process extends 
from fe — Bc to fe + že; hence, the spectrum of n,(t) extends from — Be to Be Therefore, 


Be 
N; < 
Sx,.(f) = Sx = °° Asz., (6.2.12) 
0 otherwise 
Substituting Equation (6.2.12) into Equation (6.2.10) results in i 
N Be 
= Ifl<= 
Sy,(f) = 4 Ac 2. (6.2.13) 


0 otherwise 


This equation provides an expression for the power spectral density of the filtered 
noise at the front end of the receiver. After demodulation, another filtering is applied; this 
reduces the noise bandwidth to W, which is the bandwidth of the message signal. Note that 
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in the case of FM, as seen in Equation (6.2.7), the process Y,,(t) is differentiated and scalec 

by +. Using Equation (5.2.25) from Chapter 5, which gives the power spectral density o! 

the derivative of a process, we conclude that the power spectral density of the proces: 
14y (t) is given by (see Equation 5.2.25) 


Qn dt 
No Be 
4n? f? —f* Ifl 
KE SAPS NE R ST. (62.14 
T 0 otherwise 


This means that in PM, the demodulated-noise power spectral density is given by 
Equation (6.2.13); in FM, it is given by Equation (6.2.14). In both cases, Be must bt 
replaced by W to account for the additional postdemodulation filtering. In other words 
for |f| < W, we have 


Fo PM l 
Sn (f) = No : `; (6.2.15 
X f? M 


Figure 6.4 shows the power spectrum of the noise component at the output of the demod 
ulator in the frequency interval |f| < W for PM and FM. 

It is interesting to note that PM has a flat noise spectrum and FM has a paraboli. 
noise spectrum. Therefore, the effect of noise in FM for higher-frequency components i 
much higher than the effect of noise on lower-frequency components. The noise powe 
at the output of the lowpass filter is the noise power in the frequency range [W, +W: 
Therefore, it is given by 


Snol f) | 







No 
Az 


(a) 


No 


ve fI 


-W wW Figure 6.4 Noise power spectrum at 
demodulator output for |f| < W in (a) PM 
(b) and (b) FM. 
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+W 
P, f Sn (P) df 
Ww 





: (6.2.16) 





Now we can use Equation (6.2.7) to determine the output signal-to-noise ratio in 
angle modulation. First, we have the output signal power 

















k2Py PM 
Reno ei (6.2.17) 
kPy FM 
Then the SNR, which is defined as 
S) at Pro 
N}, Pu 
becomes iste 
kA; Pu M 
N), |334 Pu mO E 
2W2 NoW 
Noting that A is the received signal power, denoted by Pr, and 
B, = kp max|m(t)| PM 
kp max |m(¢| (6.2.19) 
= ———— FM 
By wW 
we may express the output SNR as 
2 
P 
o elea) aa ™ 
(=) = maximal No (6.2.20) 
NJ, F zg) Py 
R\ max |m()|) NoW 


If we denote ney by (2) p the SNR of a baseband system with the same received power, 
we obtain j 
Pub; 


ie zG) PM 
(Š) _ | (max|m@)|)? \N/, l (6.2.21) 


N r Pub? () 
(max |m(t|)? \N b 
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Note that in the preceding expression, mx tape is the average-to-peak power ratio oj 
the message signal (or equivalently, the power content of the normalized message, Py, ) 
Therefore, 


S 
sy [Beu (=) ™ 
(=) = $f A (6.2.22 
o 3 2p pa 
Py Pata ©) oe 


Now using Carson’s rule Be = 2(6 + 1)W, we can express the output SNR in terms of th 
bandwidth expansion factor, which is defined as the ratio of the channel bandwidth to thi 
message bandwidth and is denoted by Q: 
def Bo _ 
Q= Ta 2(B + 1). (6.2.23 


From this relationship, we have B = g — 1. Therefore, 


Q 2 
—-1 S 
2 
Pu max [mE] ($) PM 
S 
($) = a4 (6.2.24 


—-] S 
2 = 
3Pm ATO] (3), FM 
From Equations (6.2.20) and (6.2.24), we can make several observations: 


1. In both PM and FM, the output SNR is proportional to the square of the modulatic 
index £. Therefore, increasing B increases the output SNR, even with low receive 
power. This is in contrast to amplitude modulation, where such an increase in tt 
received SNR is not possible. 

2. The increase in the received SNR is obtained by increasing the bandwidth. Therefor 
angle modulation provides a way to trade off bandwidth for transmitted power. 

3. The relation between the output SNR and the bandwidth expansion factor Q is 
quadratic relation. This is far from optimal.* Information theoretical analysis of tł 
performance of communication systems shows that the optimal relation between tl 
output SNR and the bandwidth expansion ratio is an exponential relation. 


4. Although we can increase the output SNR by increasing £, having a large B mea 
having a large B. (by Carson’s rule). Having a large Be means having a large noi 
power at the input of the demodulator. This means that the approximation P(V,,(¢) - 
Ac) © 1 will no longer apply and that the preceding analysis will not hold. In fact, 
we increase B such that the preceding approximation does not hold, a phenomen 
known as the threshold effect will occur and the signal will be lost in the noise. TI 
means that although increasing the modulation index up to a certain value improv 


‘By optimal relation, we mean the maximum savings in transmitter power for a given expansion in bai 
width. An optimal system achieves the fundamental limits on communication, as predicted by information thec 
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the performance of the system, this cannot continue indefinitely. After a certain point, 
increasing 6 will be harmful and deteriorates the performance of the system. 

5. A comparison of the preceding result with the SNR in amplitude modulation shows 
that, in both cases, increasing the transmitter power (and consequently the received 
power) will increase the output SNR, but the mechanisms are totally different. In 
AM, any increase in the received power directly increases the signal power at the 
output of the demodulator. This is basically because the message is in the ampli- 
tude of the transmitted signal and an increase in the transmitted power directly 
affects the demodulated signal power. However, in angle modulation, the message 
is in the phase of the modulated signal and, consequently, increasing the transmitter 
power does not increase the demodulated message power. In angle modulation, the 
output SNR is increased by a decrease in the received noise power, as seen from 
Equation (6.2.16) and Figure 6.1. 

6. In FM, the effect of noise is higher at higher frequencies. This means that signal 
components at higher frequencies will suffer more from noise than signal compo- 
nents at lower frequencies. In some applications where FM is used to transmit SSB- 
FDM (frequency-division multiplexing) signals, those channels that are modulated 
on higher-frequency carriers suffer from more noise. To compensate for this effect, 
such channels must have a higher signal level. The quadratic characteristics of the 
demodulated noise spectrum in FM is the basis of preemphasis and deemphasis fil- 
tering, which will be discussed later in this chapter. 


Example 6.2.1 


What is the required received power in an FM system with 6 = 5 if W = 15 kHz and 
No = 1074 W/Hz? The power of the normalized message signal is assumed to be 0.1 Watt 
and the required SNR after demodulation is 60 dB. 


Solution We use the relation 


s Pr 
2 ee eee 
($) = 38 Pu ay 


with ($), = 106, 8 =5, Py, = 0.1, No = 107'4, and W = 15,000, to obtain Pr = 2 x 10-5 
or 20 microwatts. m 


6.2.1 Threshold Effect in Angle Modulation 


The noise analysis of angle-demodulation schemes is based on the assumption that the 
signal-to-noise ratio at the demodulator input is high. With this crucial assumption, we 
observe that the signal and noise components at the demodulator output are additive and we 
are able to carry out the analysis. This assumption of high SNR is a simplifying assumption 
that is usually made in the analysis of nonlinear modulation systems. Due to the nonlinear 
nature of the demodulation process, the additive signal and noise components at the input 
of the modulator do not result in additive signal and noise components at the output of 
the demodulator. In fact, this assumption is generally not correct. The signal and noise 
processes at the output of the demodulator are completely mixed in a single process by 
a complicated nonlinear relationship. Only under the high SNR assumption is this highly 
nonlinear relationship approximated as an additive form. Particularly at low SNRs, signal 
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and noise components are so intermingled that we cannot recognize the signal from the 
noise; therefore, no meaningful SNR as a measure of performance can be defined. 

In such cases, the signal is not distinguishable from the noise and a mutilation o 
threshold effect is present. There exists a specific SNR at the input of the demodulato 
(known as the threshold SNR) below which signal mutilation occurs. The existence of thr 
threshold effect places an upper limit on the trade-off between bandwidth and power iy 
an FM system. This limit is a practical limit in the value of the modulation index B,- 
The analysis of the threshold effect and the derivation of the threshold index By is quit 
involved and beyond the scope of our analysis. The references cited at the end of this boo! 
can provide an analytic treatment of the subject. This text will only mention some result 
on the threshold effect in FM. 

Atthreshold, the following approximate relation between Ne = (3) , and 8p hold 
in an FM system: 


S 
=) =20(6;+1). (6.2.25 
(gm 


From this relation, given a received power Pr, we can calculate the maximum allowed , 
to make sure that the system works above threshold. Also, given a bandwidth allocation < 
B-, we can find an appropriate 6 using Carson’s rule Be = 2(6 + 1)W. Then, using th 
preceding threshold relation, we determine the required minimum received power to mak 
the whole allocated bandwidth usable. 

In general, there are two factors that limit the value of the modulation index £. Tr 
first is the limitations on channel bandwidth, which affect 6 through Carson’s rule. Tt 
second is the limitation on the received power which limits the value of £ to less than tt 
value derived from Equation (6.2.25). Figure 6.5 shows plots of the output SNR in an FI 
system as a function of the baseband SNR. The output SNR values in these curves are : 
decibels, and different curves correspond to different values of 6, as marked. The effect « 
threshold is apparent from the sudden drops in the output SNR. These plots are drawn fi 
a sinusoidal message for which 


Py 1 
(max m(t)? 2° 


Siopa 
ORON ws 


As an example, for 6 = 5, this relation yields 


S 
= 15.7 — 
x a). 


dB 


(6.2.2 


In such a case, 


(6.2.2 








dB 


and 


(5) =120~208 dB. (6.2.2 
b,th 
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DSB, SSB 





Decibel scale 


Decibel scale ( S ) 
b 


Figure 6.5 Output SNR of an FM system as a function of the baseband SNR for various values of £. 


For B = 2, we have 


S 
=7.8 + ($) ; (6.2.30) 
N o| 
dB 


S 
(Š) = 60 ~ 17.8 dB. (6.2.31) 
N b,th 





Itis apparent that if, e.g., (5) b= 20 dB, then regardless of the available bandwidth, 
we cannot use 8 = 5 for such a system because the system will operate below threshold. 
For this case, we can use B = 2. This yields an SNR equal to 27.8 dB at the output of the 
receiver. This is an improvement of 7.8 dB over a baseband system. 

In general, if we want to employ the maximum available bandwidth, we must choose 
the largest possible £ that guarantees that the system will operate above threshold. This is 


the value of £ that satisfies 5 
(=) = 20(6 + 1). (6.2.32) 
N bth 


By substituting this value into Equation (6.2.22), we obtain 
S 
(=) = 606°(6 + 1) Pm,» (6.2.33) 
N o 


which relates a desired output SNR to the highest possible £ that achieves that SNR. 
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Example 6.2.2 
Design an FM system that achieves an SNR at the receiver equal to 40 dB and-requires the 
minimum amount of transmitter power. The bandwidth of the channel is 120 kHz; the message 
bandwidth is 10 kHz; the average-to-peak power ratio for the message, Pu, = mao is 
53 and the (one-sided) noise power spectral density is No = 1078 W/Hz. What is the requirec 
transmitter power if the signal is attenuated by 40 dB in transmission through the channel? 


Solution First, we have to see whether the threshold or the bandwidth impose a more restric: 
tive bound on the modulation index. By Carson’s rule, 


B: = 0B +1)W 
120,000 = 2(6 + 1) x 10,000, 
from which we obtain 8 = 5. Using the relation 
S 2 
(3). = 606°(B + 1) Pm, (6.2.34 


with (+), = 10*, we obtain £ ~ 6.6. Since the value of £ given by the bandwidth constrair 
is less than the value of 6B given by the power constraint, we are limited in bandwidth (a 
opposed to being limited in power). Therefore, we choose B = 5, which, when substituted i 
the expansion for the output SNR, 


S\ 3/8 k 
ORON ws 


(5) = 800 L 266.6 ~ 24.26 dB. (6.2.34 
Vie 3 


yields 


Since (Š) p= _ with W = 10,000 and No = 10-8, we obtain 


8 
Pr = — = 0.0266 ~ —15.74 dB 6.2.3 
R 300 ( 
and 
Pr = —15.74 + 40 = 24.26 dB ~ 266.66 Watts. (6.2.3 


Had there been no bandwidth constraint, we could have chosen 6 = 6.6, which would rest 
in (3), =~ 153. In turn, we would have Pg © 0.0153 and Pr ~ 153 Watts. 


6.2.2 Preemphasis and Deemphasis Filtering for FM 


As observed in Figure 6.4, the noise power spectral density at the output of the FM demo 
ulator has a parabolic shape within the message signal bandwidth. This parabolic increa 
in the noise power spectral density is due to the use of a differentiator in the FM demo 
ulator. As a consequence, the higher-frequency components of the message signal a 
degraded by this increase in noise power. 

To compensate for the increase in noise power at the higher frequencies of the me 
sage signal, we can boost the high-frequency components prior to the FM modulator at t 
transmitter and, thus, transmit these frequencies at a higher power level. This can be eas: 
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2 =i 
IH, (f)| = fire 





I [l 2 
-5f -4f -3h -2 h 0 f fo 3f 4% hh f 


Figure 6.6 Preemphasis and 
deemphasis filter characteristics. 


-5f —4% -3o -2b fo 9 fo fo 3h 4 So f 


accomplished by using a highpass filter at the transmitter, called a preemphasis filter. Thus, 
the degradation of the high-frequency components of the message due to the large noise 
power spectral density at the demodulator is reduced. 

Having boosted the high frequencies of the message signal at the wansmitter, we 
need to restore these frequency components in the demodulated message signal to their 
original form. This is accomplished by performing the inverse operation of preemphasis, 
i.e., passing the demodulated signal through a lowpass filter, called a deemphasis filter. 
Thus, the cascade of the preemphasis and the deemphasis filters has reciprocal frequency- 
response characteristics within the bandwidth occupied by the message signal, as shown in 
Figure 6.6. j 

The characteristics of the preemphasis and deemphasis filters depend largely on 
the power spectral density of the message process. In commercial FM broadcasting of 
music and voice, first-order lowpass and highpass RC filters with a time constant of 75 ws 
are employed. In this case, the frequency response of the receiver (deemphasis) filter is 
given by 


1 


where fo = oT ~ 2100 Hz is the 3 dB frequency of the filter. 

To analyze the effect of preemphasis and deemphasis filtering on the overall signal- 
to-noise ratio in FM broadcasting, we note that since the transmitter and the receiver 
filters cancel the effect of each other, the received power in the message signal remains 
unchanged, and we only have to consider the effect of filtering on the received noise. Of 
course, the only filter that has an effect on the received noise is the receiver filter, which 
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shapes the power spectral density of the noise within the message bandwidth. The noise 
component before filtering has a parabolic power spectrum. Therefore, the noise compo- 
nent after the deemphasis filter has a power spectral density given by 


Snep lf) = Sno PH Aa PP 


-Mpa 1l 


= (6.2.40) 
AY 146% 
0 





where we have used Equation (6.2.15). The noise power at the output of the demodulator 
can be obtained as 


+W 
Papp =f Snpp (f) af 


N +W 2 
=z, apa 
eJ-w 1+5 








3 W 
= aci E — arctan z] . (6.2.41, 
A Lf fo 


Since the demodulated message signal power in this case is equal to that of a simple FM 
system with no preemphasis and deemphasis filtering, the ratio of the output SNRs in thes: 
two cases is inversely proportional to the noise power ratios, i.e., 





S 
(A) op aa Pro 
s = — 
(š), Pm 
2Now? 
E 342 
~ Nof [w Ww 
mar arctan fo 


1 fo 
3 W. — arctan W’ 


(6.2.42 


where we have used Equation (6.2.16). Hence, Equation (6.2.42) gives the improvemer 
obtained by employing preemphasis and deemphasis filtering. 


Example 6.2.3 
In commercial FM broadcasting, W = 15 kHz, fo = 2100 Hz, and £ = 5. Assuming that tt 
average-to-peak power ratio of the message signal is 0.5, find the improvement in the outp 
SNR of FM when we use preemphasis and deemphasis filtering rather than a baseband systen 
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Solution From Equation (6.2.22), we have 


Z) =3x 5? x 0.5 x (5) 
NJ, N), 
S 
=37.5 (5), 
S 
zx 15.7 + (5), 


Therefore, FM with no preemphasis and deemphasis filtering performs 15.7 dB better than a 
baseband system. For FM with preemphasis and deemphasis filtering, we have 


3 

WwW 
(3) ae Mr es 
NJ» 3% —arctan® \N /, 


15,000 \3 
_1__ (ior) sS 
3 15,000 15,000 N o 


— arctan 2100 


(6.2.43) 





dB 














dB 


S 
~ 13.3 +15.7 + ( = 
(x) 


w29+ (> 
x a 


The overall improvement when using preemphasis and deemphasis filtering rather than a base- 
band system is, therefore, 29 dB. | 


b 





dB 


(6.2.44) 





dB 


6.3 COMPARISON OF ANALOG-MODULATION SYSTEMS 


Now, we are at a point where we can present an overall comparison of different analog 
communication systems. The systems that we have studied include linear modulation sys- 
tems (DSB-SC AM, conventional AM, SSB-SC AM, vestigial sideband) and nonlinear 
modulation systems (FM and PM). 

The comparison of these systems can be done from various points of view. Here we 
present a comparison based on three important practical criteria: 


1. The bandwidth efficiency of the system. 


2. The power efficiency of the system as reflected in its performance in the presence of 
noise. 


3. The ease of implementation of the system (transmitter and receiver). 
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Bandwidth Efficiency. The most bandwidth efficient analog communication sys- 
tem is the SSB-SC system with a transmission bandwidth equal to the signal bandwidth. 
This system is widely used in bandwidth critical applications, such as voice wansmission 
over microwave and satellite links and some point-to-point communication systems in con- 
gested areas. Since SSB-SC cannot effectively transmit DC, it cannot be used for the trans- 
mission of signals that have a significant DC component, such as image signals. A good 
compromise is the VSB system, which has a bandwidth slightly larger than SSB and is 
capable of transmitting DC values. VSB is used in TV broadcasting and in some data 
communication systems. PM, and particularly FM, are the least favorable systems when 
bandwidth is the major concem, and their use is only justified by their high level of noise 
immunity. 


Power Efficiency. A criterion for comparing the power efficiency of various sys- 
tems is the comparison of their output signal-to-noise ratio at a given received signal 
power. We have already seen that angle-modulation schemes, and particularly FM, pro- 
vide a high level of noise immunity and, therefore, power efficiency. FM is widely used 
on power-critical communication links, such as point-to-point communication systems and 
high-fidelity radio broadcasting. It is also used for transmission of voice (which has been 
already SSB/FDM multiplexed) on microwave line-of-sight and satellite links. Conven- 
tional AM and VSB-+C are the least power-efficient systems and are not used when the 
transmitter power is a major concern. However, their use is justified by the simplicity of 
the receiver structure. 


Ease of Implementation. The simplest receiver structure is the receiver for con- 
ventional AM, and the structure of the receiver for VSB+C system is only slightly more 
complicated. FM receivers are also easy to implement. These three systems are widely used 
for AM, TV, and high-fidelity FM broadcasting (including FM stereo). The power ineffi- 
ciency of the AM transmitter is compensated by the extremely simple structure of literally 
hundreds of millions of receivers. DSB-SC and SSB-SC require synchronous demodula- 
tion and, therefore, their receiver structure is much more complicated. These systems are. 
therefore, never used for broadcasting purposes. Since the receiver structure of SSB-SC anc 
DSB-SC have almost the same complexity and the transmitter of SSB-SC is only slightly 
more complicated than DSB-SC, DSB-SC is hardly used in analog signal transmission 
due to its relative bandwidth inefficiency. 


6.4 EFFECTS OF TRANSMISSION LOSSES AND NOISE IN ANALOG COMMUNICATION 
SYSTEMS 


In any communication system, there are usually two dominant factors that limit the perfor 
mance of the system. One important factor is additive noise that is generated by electroni: 
devices that are used to filter and amplify the communication signal. A second factor tha 
affects the performance of a communication system is signal attenuation. Basically al 
physical channels, including wireline and radio channels, are lossy. Hence, the signal i 
attenuated (reduced in amplitude) as it travels through the channel. A simple mathematica 
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Transmitted Channel Received 
signal signal 
—— 
s(t) r(t) = as(t) + n(t) 





Figure 6.7 Mathematical model of 
Attenuation Noise channel with attenuation and additive 
a n(t) noise. 


model of the attenuation may be constructed, as shown in Figure 6.7, by multiplying the 
transmitted signal by the factor œ < 1. Consequently, if the transmitted signal is s(t), the 
received signal is 


r(t) =as(t) + n(t). (6.4.1) 


Clearly, the effect of signal attenuation is to reduce the amplitude of the desired signal 
s(t) and, thus, to render the communication signal more vulnerable to additive noise. 

In many channels, such as wirelines and microwave line-of-sight channels, signal 
attenuation can be offset by using amplifiers to boost the level of the signal during trans- 
mission. However, an amplifier also introduces additive noise in the process of amplifica- 
tion and, thus, corrupts the signal. This additional noise must be taken into consideration 
in the design of the communication system. 

In this section, we consider the effects of attenuation encountered in signal transmis- 
sion through a channel and additive thermal noise generated in electronic amplifiers. We 
also demonstrate how these two factors influence the design of a communication system. 


6.4.1 Characterization of Thermal Noise Sources 


Any conductive two-terminal device is generally characterized as lossy and has some resis- 
tance, say R ohms. A resistor that is at a temperature T above absolute zero contains free 
electrons that exhibit random motion and, thus, result in a noise voltage across the terminals 
of the resistor. Such a noise voltage is called thermal noise. 

In general, any physical resistor (or lossy device) may be modeled by a noise source 
in series with a noiseless resistor, as shown in Figure 6.8. The output n(t) of the noise 


Figure 6.8 A physical resistor (a) is modeled as a noiseless resistor in series with a 
(a) (b) noise source (b). 
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n(t) 


Figure 6.9 Noisy resistor connected to a load resistance Rz. 


source is characterized as a sample function of a random process. Based on quantun 
mechanics, the power spectral density of thermal noise (see Section 5.3) is given as 


2Rh\f | 


Sr(f) = (#-1) (volts)* /Hz, (6.4.2 


erT — 


where h is Plank’s constant, k is Boltzmann’s constant, and T is the temperature of th 
resistor in degrees Kelvin, i.e., T = 273 + C, where C is in degrees Centigrade. A 
indicated in Section 5.3, at frequencies below 10!2 Hz (which includes all conventiona 
communication systems) and at room temperature, 


erT ~1+——. (6.4.3 


Consequently, the power spectral density is well approximated as 
Sr(f) =2RkT (volts)*/Hz. (6.4.4 


When connected to a load resistance with value Rz, the noise voltage shown i 
Figure 6.9 delivers the maximum power when R = RL. In such a case, the load is matche 
to the source and the maximum power delivered to the load is E[N z (t)]/4RL. Therefor 
the power spectral density of the noise voltage across the load resistor is 


S,(f) = z W/EZ. (6.4. 


As previously indicated in Section 5.3.2, kT is usually denoted by No. Hence, the powt 
spectral density of thermal noise is generally expressed as 


Sa (f) = = W/Hz. (6.4.1 
For example, at room temperature (Tọ = 290° K), No = 4 x 10-7! W/Hz. 
6.4.2 Effective Noise Temperature and Noise Figure 
When we employ amplifiers in communication systems to boost the level of a signal, v 
are also amplifying the noise corrupting the signal. Since any amplifier has some fini 


passband, we may model an amplifier as a filter with the frequency-response characterist 
H(f). Let us evaluate the effect of the amplifier on an input thermal noise source. 
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Thermal 
noise > 
source 


Amplifier 


H(f) : i 
Figure 6.10 Thermal noise 
Matched Matched connected to amplifier and load. 


—— Load 






Figure 6.10 illustrates a thermal noise source connected to a matched two-port net- 
work having the frequency response H (f). The output of this network is connected to a 
matched load. First, we recall that the noise power at the output of the network is 


[oe] N foe) 
Pro= f SED =F f EPR at (6.4.7) 


From Section 5.3.3, we recall that the noise equivalent bandwidth of the filter is defined as 
1 se 2 

Breq = lH(f)I df, (6.4.8) 
2G Joo 


where, by definition, = |H (f) |Z ax is the maximum available power gain of the amplifier. 
Consequently, the output noise power from an ideal amplifier that introduces no additional 
noise may be expressed as 


Pro = G NoBneq: (6.4.9) 


Any practical amplifier introduces additional noise at its output due to internally 
generated noise. Hence, the noise power at its output may be expressed as 


Pao = G No Bneq + Pri 
= GT Beg + Pris | (6.4.10) 


where Pai is the power of the amplifier output due to internally generated noise. Therefore, 





Pri 
Pro = GKBneq ( T All 
i a( n zsz) Ca) 


This leads us to define a quantity 


Pri 


T: = ——, 6.4.12 
G KBneq ERE 


which we call the effective noise temperature of the two-port network (amplifier). Then 
Pro = GRBueg(T + Te). (6.4.13) 


Thus, we interpret the output noise as originating from a thermal noise source at the tem- 
perature T + Te. 

A signal source at the input to the amplifier with power P,; will produce an output 
with power 


Pig = G Pii: (6.4.14) 
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Hence, the output SNR from the two-port network is 





Syo Pee G Pj 
ii Gr ~ Pao YKT Breq(1 + Te/T) 
i r Psi 
| ~ NoBreq(1 + T/T). 


1 S 
SaN (5), (6.4.15) 


where, by definition, (S/N); is the input SNRto the two-port network. We observe that the 
SNR at the output of the amplifier is degraded (reduced) by the factor (1 + T/T). Thus, T, 
is a measure of the noisiness of the amplifier. An ideal amplifier is one for which T, = 0. 
| When T is taken as room temperature Tp (290° K), the factor (1+T,/Tp) is called the 
noise figure of the amplifier. Specifically, the noise figure of a two-port network is defined 
j j as the ratio of the output noise power Pno to the output noise power of an ideal (noiseless) 
hi two-port network for which the thermal noise source is at room temperature (T = 290° K). 
Clearly, the ratio 








F= (1 4 z) (6.4.16 
To 





is the noise figure of the amplifier. Consequently, Equation (6.4.15) may be expressed as 


(=) == ($) l (6.4.17 
N), F\WN), 


By taking the logarithm of both sides of Equation (6.4.17), we obtain 





S S , 
10 log (=) = —10log F + 10log (=) ; (6.4.18 
N/o N/; 
Hence, 10log F represents the loss in SNR due to the additional noise introduced by thx 
amplifier. The noise figure for many low-noise amplifiers, such as traveling wave tubes, i 
below 3 dB. Conventional integrated circuit amplifiers have noise figures of 6 dB to 7 dB. 
It is easy to show (see Problem 6.16) that the overall noise figure of a cascade of K 
amplifiers with gains &, and corresponding noise figures Fk, 1 < k < K is 
Fy-1  F3-1 Fer -1 
2 pa E E eee eS (6.4.19 
G A BAH... Gg- 
This expression is known as Fries’ formula. We observe that the dominant term is Fi 
| which is the noise figure of the first amplifier stage. Therefore, the front end of a receive 
Ni should have a low noise figure and a high gain. In that case, the remaining terms in the sur 
will be negligible. 


F=F,+ 

















a Example 6.4.1 

Suppose an amplifier is designed with three identical stages, each of which has a gain c 
l @, = 5 and a noise figure F; = 6, i = 1, 2,3. Determine the overall noise figure of th 
i cascade of the three stages. 






























































Section 6.4 Effects of Transmission Losses € 283 


Solution From Equation (6.4.19), we obtain 


Paa 
G QQ’ 


where Fi = Fy = F = 6 and Y = % = 5, Hence, 


F=F + 





F: =6+1+0.2 =7.2, 


or, equivalently, Figg = 8.57 dB. L] 


6.4.3 Transmission Losses 


As we indicated previously, any physical channel attenuates the signal transmitted through 
it. The amount of signal attenuation generally depends on the physical medium, the fre- 
quency of operation, and the distance between the transmitter and the receiver. We define 
the loss £ in signal transmission as the ratio of the input (transmitted) power to the output 
(received) power of the channel, i.e., 


Pr 
Q€ = —, 6.4.20 
Pr ( ) 
or, in decibels, 
Las = 10log E = 10log Pr — 10log Pr. (6.4.21) 


In wireline channels, the transmission loss is usually given in terms of decibels per unit 
length, e.g., dB/km. For example, the transmission loss in coaxial cable of 1 cm diameter 
is about 2 dB/km at a frequency of 1 MHz. This loss generally increases with an increase 
in frequency. 


Example 6.4.2 
Determine the transmission loss for a 10 km and a 20 km coaxial cable if the loss per kilometer 
is 2 dB at the frequency operation. 


Solution The loss for the 10 km channel is £43 = 20 dB. Hence, the output (received) 
power is Pg = Pr/L = 10-* Pr. For the 20 km channel, the loss is £33 = 40 dB. Hence, 
Pr = 10~* Pr. Note that doubling the cable length increases the attenuation by two orders of 
magnitude. a 


In line-of-sight radio systems, the transmission loss is given as 


2 
f= (=) ; (6.4.22) 


where à = c/f is the wavelength of the transmitted signal, c is the speed of light (3 x 
108 m/sec), f is the frequency of the transmitted signal, and d is the distance between the 
transmitter and the receiver in meters. In radio wansmission, £ is called the free-space path 
loss. 
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Example 6.4.3 
Determine the free-space path loss for a signal transmitted at f = 1 MHz over distances o 
10 km and 20 km. 


Solution The loss given in Equation (6.4.22) for a signal at a wavelength A = 300 m is 


Lag = 201log,9(4n x 10*/300) 
= 52.44 dB (6.4.23 


for the 10 km path and 


Laz = 20l0g,o (87 x 10/300) 
= 58.44 dB (6.4.24 


for the 20 km path. It is interesting to note that doubling the distance in radio transmissio; 
increases the free-space path loss by 6 dB. ! 


Example 6.4.4 
A signal is transmitted through a 10 km coaxial line channel, which exhibits a loss of 2 dB/kir 
The transmitted signal power is Pigg = —30 dBW (—30 dBW means 30 dB below 1 Watt o; 
simply, one milliwatt). Determine the received signal power and the power at the output of a 
amplifier that has a gain of Gp = 15 dB. 


Solution The transmission loss for the 10 km channel is £a = 20 dB. Hence, the receive 
signal power is 


Prag = Prag — Lag = —30 — 20 = —50 dBW. (6.4.25 


The amplifier boosts the received signal power by 15 dB. Hence, the power at the output c 
the amplifier is 


Pogp = Prag + Gap 
= —50+ 15 = —35 dBW. (6.4.2¢€ 


6.4.4 Repeaters for Signal Transmission 


Analog repeaters are basically amplifiers that are generally used in telephone wire-lin 
channels and microwave line-of-sight radio channels to boost the signal level and, thus, t 
offset the effect of signal attenuation in transmission through the channel. 

Figure 6.11 illustrates a system in which a repeater is used to amplify the signal th: 
has been attenuated by the lossy transmission medium. Hence, the input signal power ; 
the input to the repeater is 


Transmitter |Pr 
Pr 


Pr = Pr/&. (6.4.2° 


Ciosy | Pe= Sampie | (i 
Lossy Pr= F _ | Amplifier | W4 Figure 6.11 A communication 
channel =| SaF, | system employing a repeater to 
£ compensate for channel loss. 
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The output power from the repeater is 
Po = G PR = Ba Pr/£. (6.4.28) 


We may select the amplifier gain G, to offset the transmission loss. Hence, %, = £ and 
Po = Pr. 
Now, the SNR at the output of the repeater is 





iy Pr \_1( Pr ) 
~ F, NoBneq z Fa LNoBneq 
1 Pr 
eer ; 6.4.2 
Fag (sae) ( A 


Based on this result, we may view the lossy transmission medium followed by the amplifier 
as a cascade of two networks: one with a noise figure £ and the other with a noise figure 
F,. Then, for the cascade connection, the overall noise figure is 


Fal 








F=£+ (6.4.30) 


a 


If we select %, = 1/Y, then 





F=£ gi = Fa. (6.4.31) 


1 E 
Hence, the cascade of the lossy transmission medium and the amplifier is equivalent to a 
single network with the noise figure £F. 
Now, suppose that we transmit the signal over K segments of the channel, where 
each segment has its own repeater, as shown in Figure 6.12. Then, if F; = £; Fai is the 
noise figure of the ith section, the overall noise figure for the K sections is 


L Fn -1 £3F 3 —1 
Pilati. get tee ey 
Fá Gai /Ly (Gai /L1)( Ga2/L2) 
L£xFax —1 





(6.4.32) 


Channel 
Ly 


ECAA (Gax/L£x) 


Transmitter Lee Channel bg Repeater | p Channel Ae h 
Pr £ Gar, Fa $ Eg Fa 






Repeater 
San» F, an 


all 


Figure 6.12 A communication system employing repeaters. 
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Therefore, the signal-to-noise ratio at the output of the repeater (amplifier) at th 
receiver is 





1 Pr ) 
== ; (6.4.33 
F Gas 


In the important special case where the K segments are identical, i.e., £; = £ fo 
alli and Fa; = Fa for all i, and where the amplifier gains are designed to offset the losse 
in each segment, i.e., Gai = $; for all i, then the overall noise figure becomes 


F = KEF, — (K — 1) ~ KEF. (6.4.34 


S 1 P 
(=) N ( : | (6.4.35 
NJo KEF, \NoBneg 


Therefore, the overall noise figure for the cascade of the K identical segments is simply } 
times the noise figure of one segment. 


Hence, 








Example 6.4.5 

A signal with the bandwidth 4 kHz is to be transmitted a distance of 200 km over a wirelin 
channel that has an attenuation of 2 dB/km. (a) Determine the transmitter power Pr require 
to achieve an SNR of (S/N), = 30 dB at the output of the receiver amplifier that has 
noise figure Faag = 5 dB. (b) Repeat the calculation when a repeater is inserted every 1 
km in the wireline channel, where the repeater has a gain of 20 dB and a noise figure c 
F, = 5 dB. Assume that the noise equivalent bandwidth of each repeater is Byeq = 4 kHz an 
that No = 4 x 107°! W/Hz. 


Solution 


1. The total loss in the 200 km wireline is 400 dB. From Equation (6.4.35), with K = 1, w 


ai 10log(S/N) = —10 log £ — 10 log F, — 10 log(NoBneq) + 10log Pr. 
Hence, 
Pras = (S/N )oag + Faas + (NoBneg)ag + 10log £ 
= 30 + 5 + 400 + (NoBneq)aB- 
But 


(No Bneq)as = 10log(1.6 x 1077) = —168 dBW, 
where dBW denotes the power level relative to 1 Watt. Therefore, 


Pras = 435 — 168 = 267 dBW 
Pr = 5 x 10% Watts, 


which is an astronomical figure. 
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2. The use of a repeater every 10 km reduces the per segment loss to Lag = 20 dB. There are 
20 repeaters and each repeater has a noise figure of 5 dB. Hence, Equation (6.4.35) yields 


(S/N)oas = —10log K — 10 log £ — 10 log F, — 10 log(No Breg) + 10 log Pr 


and 
30 = —13 — 20 — 5 + 168 + Pras- 
Therefore, 
Prag = —100 dBW, 
or equivalently, 


Pr = 107! Watts (0.1 picowatts). a 


The preceding example clearly illustrates the advantage of using analog repeaters in 
communication channels that span large distances. However, we also observed that analog 
repeaters add noise to the signal and, consequently, degrade the output SNR. It is clear 
from Equation (6.4.35) that the transmitted power Pr must be increased linearly with the 
number K of repeaters in order to maintain the same (S/N), as K increases. Hence, for 
every factor of 2 increase in K, the transmitted power Pr must be increased by 3 dB. 





- 6.5 SUMMARY AND FURTHER READING 


In this chapter, we determined the effects of additive noise in the demodulation of AM, 
FM, and PM analog signals. We began by evaluating the signal-to-noise ratio (SNR) of 
the received signal at the output of the demodulator for DSB-SC AM, SSB AM, and con- 
ventional AM. In Example 6.1.2, we compared these three types of AM signals on the 
basis of the wansmitted power required to achieve an output SNR of 50 dB in additive 
white Gaussian with power spectral density of 107!? W/Hz and an attenuation of 50 dB. 
We found that the required transmitter power for DSB-SC and SSB is 200 Watts and, for 
conventional AM, it is 909 Watts. 

We performed a similar evaluation of the output SNR in angle-modulated (FM and 
PM) systems, and we observed the important role played by the modulation index 8, which 
provides a measure of the bandwidth expansion‘in angle-modulated systems. We demon- 
strated that the channel bandwidth required to transmit the angle-modulated signal is given 
by Carson’s rule as Be = 2(8 + 1) W, where W is the bandwidth of the information-bearing 
signal. We also considered the threshold effect in the demodulation of angle-modulated 
signals and described the signal distortion that occurs when the received signal SNR falls 
below a precomputed threshold. To compensate for high-frequency noise enhancement that 
occurs in the demodulation of FM signals, we inwoduced preemphasis filtering at the trans- 
mitter in order to boost the high frequencies of the transmitted signal. At the receiver end, 
a deemphasis filter is used following the FM demodulator in order to restore the FM signal 
to its original form and thus undo the preemphasis filtering performed at the transmitter. 
Angle-modulated systems are compared with amplitude-modulated systems on the basis 
of bandwidth efficiency, power efficiency, and ease of implementation. 
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In the last section of this chapter, we characterized thermal noise sources in term 
of this effective noise temperature and noise figure. Finally, we considered transmissio 
losses and the benefits of inserting analog repeaters in long-distance signal transmission. 

Analysis of the effect of noise on analog communication systems can be found i 
many textbooks on communications, including Carlson (1986), Ziemer and Tranter (2002' 
Couch (1993), and Gibson (1993). The book of Sakrison (1968) provides a detailed anal 
ysis of FM in the presence of noise. Taub and Schilling (1986) provide in-depth treatmer 
of the effect of threshold and various methods for threshold extension in FM. 


PROBLEMS 


6.1 Thereceived signal r (t) = s (t) +n (t) ina communication system is passed throug 
an ideal lowpass filter (LPF) with bandwidth W and unity gain. The signal compo 
nent s (t) has a power spectral density 


2. 
1 + (F/B)? 


where B is the 3 dB bandwidth. The noise component n (t) has a power spectra 
density No/2 for all frequencies. Determine and plot the SNR as a function of th 
ratio W/B. What is the filter bandwidth W that yields a maximum SNR? 


S (f) = 


6.2 The input to the system shown in Figure P-6.2 is the signal plus noise waveform 
r (t) = Ac cos 2n fet +a (t), 


where z (t) is a sample function of a white noise process with spectral density No/2 


1. Determine and sketch the frequency response of the RC filter. 
2. Sketch the frequency response of the overall system. 


3. Determine the SNR at the output of the ideal LPF assuming that W > fe wher 
W denotes the bandwidth of the LPF. Sketch the SNR as a function of W fo 


fixed values of R and C. 
C 
if i PS, 
i nN 
r(t) R Output 





Figure P-6.2 
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6.3 A DSB amplitude-modulated signal with a power spectral density, as shown in Fig- 


ure P-6.3(a), is corrupted with additive noise that has a power spectral density No/2 
within the passband of the signal. The received signal plus noise is demodulated and 
the lowpass is filtered, as shown in Figure P-6.3(b). Determine the SNR at the output 
of the LPF. 





A(f) 

Sml F) j 1 
Poj -W wW F 
r(t) LPF Output 

E-W fo tet WF aad 
cos (2nf,t) 
(a) (b) 
Figure P-6.3 
6.4 A certain communication channel is characterized by a 90 dB attenuation and addi-. 


6.5 


tive white noise with the power spectral density of 4° = 0.5 x 107! W/Hz. The 
bandwidth of the message signal is 1.5 MHz, and its amplitude is uniformly dis- 
tributed in the interval [—1, 1]. If we require that the SNR after demodulation be 
30 dB, find the necessary transmitter power in each of the following cases: 


1. Upper single-sideband (USSB) modulation. 
2. Conventional AM with a modulation index of 0.5. 


3. DSB-SC modulation. 


A sinusoidal message signal, whose frequency is less than 1000 Hz, modulates the 
carrier c(t) = 107? cos2n fect. The modulation scheme is conventional AM and the 
modulation index is 0.5. The channel noise is additive white noise with a power 
spectral density of m% = 107!? W/Hz. At the receiver, the signal is processed, as 
shown in Figure P-6.5 (a). The frequency response of the bandpass noise-limiting 
filter is shown in Figure P-6.5 (b). 


1. Find the signal power and the noise power at the output of the noise-limiting 
filter. 


2. Find the output SNR. 
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H(f) 
` Bandpass 
roe ——>| noise-limiting 
E filter 
2 cos 2nf,t 
(a) 
H(f) 
—>| 2000 Hz |< 1 —>| 2000 Hz H- 






=f; f.—1500 f.  f.+1500 F 
(b) 


Figure P-6.5 


6.6 In an analog communication system, demodulation gain is defined as the ratio of th 
SNR at the output of the demodulator to the SNR at the output of the noise-limitin; 
filter at the receiver front end. Find expressions for the demodulation gain in each o 
the following cases: 


1. DSB. 
2. SSB. 


3. Conventional AM with a modulation index of a. What is the largest possibl 
demodulation gain in this case? 


4. FM with a modulation index Br. 
5. PM with a modulation index >. 


6.7 In a broadcasting communication system, the transmitter power is 40 kW, the chan 
nel attenuation is 80 dB, and the noise power spectral density is 10-!° W/Hz. Th 
message signal has a bandwidth of 104 Hz. 


. Find the predetection SNR (SNR in r(t) = au(t) + n(t)). 
. Find the output SNR if the modulation is DSB. 
. Find the output SNR if the modulation is SSB. 


. Find the output SNR if the modulation is conventional AM with a modulatio 
index of 0.85 and has a normalized message power of 0.2. 


hwnd = 


6.8 A communication channel has a bandwidth of 100 kHz. This channel is to be use 
for transmission of an analog source m(t), where |m(t)| < 1, and its bandwidth i 
W =4 kHz. The power content of the message signal is 0.1 Watt. 
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6.9 


6.10 


m(t) 
mt) 
m(t) 


mx(t) 


1. Find the ratio of the output SNR of an FM system that utilizes the whole band- 
width to the output SNR of a conventional AM system with a modulation index 
of a = 0.85. What is this ratio in dB? 


2. Show that if an FM system and a PM system are employed and these systems 
have the same output SNR, then 


BWpm _ v36 +1 
BWem Bp+1 ` 





The normalized message signal m,„(t) has a bandwidth of 5000 Hz and power of 0.1 
Watt, and the channel has a bandwidth of 100 kHz and attenuation of 80 dB. The 
noise is white with a power spectral density 0.5 x 10~!* W/Hz, and the wansmitter 
power is 10 kW. 


1. If AM with a = 0.8 is employed, what is ($)? 
2. If FM is employed, what is the highest possible (5)? 


A normalized message signal has a bandwidth of W = 8 kHz and a power of Py, = 
L, We must transmit this signal via a channel with an available bandwidth of 60 
kHz and attenuation of 40 dB. The channel noise is additive and white with a power 
spectral density of zt = 107! W/Hz. A frequency modulation scheme, with no 
preemphasis/deemphasis filtering, has been proposed for this purpose. 


1. If we want an SNR of atleast 40 dB at the receiver output, what is the minimum 
required transmitter power and the corresponding modulation index? 


2. If the minimum required SNR is increased to:60 dB, how would your answer 
change? 

3. If in Part 2, we are allowed to employ preemphasis or deemphasis filters with a 
time constant of t = 75 usec, how would the answer change? 


In transmission of telephone signals over line-of-sight microwave links, a combina- 
tion of FDM-SSB and FM is often employed. A block diagram of such a system is 
shown in Figure P-6.11. 


Each of the signals m;(t) is bandlimited to W Hz, and these signals are USSB mod- 
ulated on carriers c;(t) = A; cos 27 feit, where feae = (i — LW, 1 <i < K, and 
m(t) is the sum of all USSB-modulated signals. This signal FM modulates a carrier 
with frequency fe with a modulation index of £. 


Figure P-6.11 
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1. Plot a typical spectrum of the USSB-modulated signal m (t). 
2. Determine the bandwidth of m(t). 


3. At the receiver side, the received signal r(t) = u(t) + ny(t) is first FM demod. 
ulated and then passed through a bank of USSB demodulators. Show that the 
noise power entering these demodulators depends on i. 


4. Determine an expression for the ratio of the noise power entering the demodu. 
lator, whose carrier frequency is f; to the noise power entering the demodulato: 
with the carrier frequency f;,1 <i, j < K. 


5. How should the carrier amplitudes A; be chosen to guarantee that, after USSE 
demodulation, the SNR for all channels is the same? 


6.12 A power meterthat measures average power is connected to the output of a transmit 
ter, as shown in Figure P-6.12. The meter reading is 20 Watts when it is connectec 
to a 50 Q load. Determine 


1. The voltage across the load resistance. 
2. The current through the load resistance. 


3. The power level in dBm units. 


Figure P-6.12 


6.13 A twisted-pair telephone wireline channel with characteristic impedance Ze = 300 $ 
is terminated with a load of Z; = 300 Q. The telephone line is 200 km long and ha. 
a loss of 2 dB/km. 


1. If the average wansmitted power Pr = 10 dBm, determine the received powe 
Pe if the line contains no repeaters. 


2. If repeaters with a gain of 20 dB are used to boost the signal on the channel, am 
if each repeater requires an input signal level of 10 dBm, determine the numbe 
of repeaters and their spacing. The noise figure of each repeater is 6 dB. 


6.14 A radio antenna pointed in a direction of the sky has a noise temperature of 50° K 
The antenna feeds the received signal to the preamplifier, which has a gain of 35 d} 
over a bandwidth of 10 MHz and a noise figure of 2 dB. 

1. Determine the effective noise temperature at the input to the preamplifier. 


2. Determine the noise power at the output of the preamplifier. 
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6.15 An amplifier has a noise equivalent bandwidth Byeq = 25 kHz and a maximum 
available power gain of % = 30 GB. Its output noise power is 10°87, where To 
denotes the ambient temperature. Determine the effective noise temperature and the 
noise figure. 


6.16 Prove that the effective noise temperature of k two-port networks in cascade is 
Ta 13 rae Tek 
Gi Gh GR Ge 


Using this relationship, prove Fries’ formula, which is given in Equation (6.4.19). 








COMPUTER PROBLEMS 


6.1 Effect of Noise on DSB-SC AM 


The transmitted message signal is given by 


sinc(100t) O<t<t 
m(t) = bs 
0 otherwise 
where fo = 0.1. The message signal modulates the carrier c(t) = cos 21 fet, where 
J: = 250 Hz, to produce the DSB-SC AM signal u(t) = m(t)c(t). 


1. By selecting the sampling interval t; = 0.0001, generate 1000 samples of the 
message signal m (t) and the modulated signal u(t) att = nts,n = 0, 1,..., 999, 
and plot both signals. 


2. Generate a sequence of 2000 zero-mean and unit-variance Gaussian random 
variables. Form the received signal sequence, 


r(nts) = r (n) = u(nts) +o [we(nts) cos 27t fonts — ws(nts) sin 27 fents] 
= u(n) +o [w-(n) cos 27 fents — ws(n) sin 27 fonts], 


where w,(t) and ws(t) represent the quadrature components of the additive 
Gaussian noise process and ø? is a scale factor that is proportional to the noise 
power. Generate and plot the received signal sequence {7(n)} for the following 
values of o: o = 0.1, o = 1, and o =2. 


3. Demodulate the received signal sequence {r (n)} by using the demodulator shown 
in Figure CP-6.1, and plot the received message signal m, (t) for each of the 
three values of o. The lowpass filter is a linear phase FIR filter, which has 31 
taps, a cutoff frequency (—3 dB) of 100 Hz, and a stopband attenuation of at 
least 30 dB. Comment on the effect of the additive noise on the demodulated 
signal m, (t) by comparing m,(t) with the transmitted message signal m(t). 


4. Determine the SNR at the receiver output for the three values of ø. 


















































294 Effect of Noise on Analog Communication Systems Chapter ¢ 


r(t) m,(t) 


Lowpass filter 


cos 2nf,t 
t= nton =O, 1, ... 999 Figure CP-6.1 Demodulator for DSB-SC AM. 


6.2 Effect of Noise on SSB-SC AM 
Repeat Computer Problem 6.1 when the transmitted signal is an SSB-SC AM. 
6.3 Effect of Noise on Conventional AM 


A message signal is given by 


sinc(100r) O0<t <%t 
0 otherwise 


where tọ = 0.1. The message signal modulates the carrier c(t) = cos 27 fet, wher: 
fe = 250 Hz, to produce a conventional AM signal u(t). The modulation index i 
a=0.8. 


1. By selecting the sampling interval ts = 0.0001, generate 1000 samples of th 
message signal m(t) and the modulated signal 


u(t) = [1 + am(t)]cos 2r fetl, 


. =nts, n=0,1,...,999° 
Plot both signals. 

2. Generate a sequence of 2000 zero-mean and unit-variance Gaussian randor 
variables. Form the received signal sequence, 

r(nts) =r(n) = u(nt,) + o [w-(nt,) cos 2n fonts — ws(nt,) sin 27 fonts] 
= u(n) +o [w,(n) cos 27 fonts — ws(n) sin 27 fonts], 

where w-(t) and ws(t) represent the quadrature components of the additiv 
Gaussian noise process and a” is a scale factor that is proportional to the nois 


power. Generate and plot the received signal sequence {r (n)} for the followin 
values of o: o =0.1,o = 1, and o = 2. 


3. Demodulate the received signal sequence {r(n)} by using an envelope detectc 
that computes 


e(t) = y1 +am(t) + we(t)}? + w(t) |, ats n=0,1,2,..999" 


Plot e(t) for each of the three values of o. Compare e(t) with the original met 
sage signal m(t), and comment on the effect of the additive noise on the democ 
ulated signal. Note that in the absence of noise, the message signal m(t) can b 
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obtained from the envelope signal e(t) by subtracting the DC offset, which is 
equal to 1. In the presence of noise, what is the DC offset? 


4. Determine the SNR at the receiver output for the three values of ø. 
6.4 Effect of Noise on FM 


The message signal 
sinc(100t) 0 <t < to 
m(t) = Pais 
0 otherwise 


where to = 0.1, frequency modulates the carrier c(t) = cos 27 fet, in which f, = 
250 Hz. The frequency-deviation constant is kz = 100. The frequency-modulated 


signal is 
t 


u(t) = cos (2xre + 20k f 


—00 


m(t) ar) f 


1. By selecting the sampling interval t, = 0.0001, generate 2000 samples of the 
message signal m(t) and its integral. Plot them on separate graphs. 


2. Generate and plot 2000 samples of the FM signal u(t) in the time interval 
It] < to. 


3. Use MATLAB’s Fourier-transform routine to compute and plot the spectra of 
m(t) and u(t) on separate graphs. 


4. Generate a sequence of 3998 zero-mean and unit-variance Gaussian random 
variables. Form the received signal sequence 


r(nts) = r (n) = u(nts) to [we (nts) cos 27 fents — ws(nts) sin 27 fents] 
= u(n) + o [we(n) cos 2x fonts — ws(n) sin 27 fents] 


for n = 0, +1, 2, .. . , £999, where w-(t) and ws(t) represent the quadrature 
components of the additive Gaussian noise process and ø? is a scale factor that is 
proportional to the noise power. Generate and plot the received signal sequence 
{r(n)} foro = 0.1 and o = 1. 


5. Demodulate the received signal sequence {r (n)} to obtain the received message 
signal m,(t). Compare the result with the original message signal. The FM signal 
can be demodulated by first finding the phase of u(t), i.e., the integral of m(t), 
which can be differentiated and divided by 21k to yield m(t). Use the MAT- 
LAB function unwrap . m to undo the effect of 2 phase foldings. Comment on 
how well the demodulated signal matches the original signal m (t). 
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Communication systems are designed to transmit information. In any communication sys 
tem, there exists an information source that produces the information; the purpose of th 
communication system is to transmit the output of the source to the destination. In radi 
broadcasting, for instance, the information source is either a speech source or a musi 
source. In TV broadcasting, the information source is a video source that outputs a movin 
image. In FAX transmission, the information source produces a still image. In communica 
tion between computers, either binary data or ASCII characters are transmitted; therefore 
the source can be modeled as a binary or ASCII source. In storage of binary data on 
computer disk, the source is again a binary source. 

In Chapters 3, 4, and 6, we studied the transmission of analog information usin 
different types of analog modulation. The rest of this book deals with transmission c 
digital data. Digital data transmission provides a higher level of noise immunity, mor 
flexibility in the bandwidth—power trade-off, the possibility of applying cryptographic an 
antijamming techniques, and the ease of implementation using a large-scale integratio 
of circuits. In order to employ the benefits of digital data transmission, we have to fir: 
convert analog information into digital form. Conversion of analog signals into digital dat 
should be carried out with the goal of minimizing the signal distortion introduced in th 
conversion process. 

In order to convert an analog signal to a digital signal, i.e., a stream of bits, thre 
operations must be completed. First, the analog signal has to be sampled, so that we ca 
obtain a discrete-time continuous-valued signal from the analog signal. This operation i 
called sampling. Then the sampled values, which can take an infinite number of value: 
are quantized, i.e., rounded to a finite number of values. This is called the quantizatio 
process. After quantization, we have a discrete-time, discrete-amplitude signal. The thir 
He stage in analog-to-digital conversion is encoding. In encoding, a sequence of bits (one 
i! and zeros) are assigned to different outputs of the quantizer. Since the possible outputs ¢ 
ih. the quantizer are finite, each sample of the signal can be represented by a finite number ¢ 
bits. For instance, if the quantizer has 256 = 2° possible levels, they can be represented b 
it 8 bits. 
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-7.1 SAMPLING OF SIGNALS AND SIGNAL RECONSTRUCTION FROM SAMPLES 


The sampling theorem is one of the most important results in the analysis of signals; 
it has widespread applications in communications and signal processing. This theorem 
and its numerous applications clearly show how much can be gained by employing the 
frequency-domain methods and the insight provided by frequency-domain signal analysis. 
Many modern signal-processing techniques and the whole family of digital communication 
methods are based on the validity of this theorem and the insight provided by it. In fact, 
this theorem, together with results from signal quantization techniques, provide a bridge 
that connects the analog world to digital communication techniques. 


7.1.1 The Sampling Theorem 


The idea leading to the sampling theorem is very simple and quite intuitive. Let us assume 
that we have two signals, xı (t) and x2(t), as shown in Figure 7.1. The first signal xı (t) 
is a smooth signal, which varies very slowly; therefore, its main frequency content is at 
low frequencies. In contrast, x2(t) is a signal with rapid changes due to the presence of 
high-frequency components. We will approximate these signals with samples taken at reg- 
ular intervals Tı and T2, respectively. To obtain an approximation of the original signal, we 
can use linear interpolation of the sampled values. It is obvious that the sampling interval 
for the signal x (t) can be much larger than the sampling interval necessary to reconstruct 
signal x2(¢) with comparable distortion. This is a direct consequence of the slow time vari- 
ations of the signal x; (t) compared to x2(t). Therefore, the sampling interval for the signals 
of smaller bandwidth can be made larger, or the sampling frequency can be made smaller. 
The sampling theorem is a precise statement of this intuitive reasoning. It basically states 
two facts: 


1. If the signal x(t) is bandlimited to W, i.e., if X(f) = O0 for |f| = W, then it is 


sufficient to sample at intervals T, = a 


x(t) 





(b) Figure 7.1 Sampling of signals. 
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2. If we are allowed to employ more sophisticated interpolating signals than linear inter 
polation, we are able to recover the exact original signal from the samples, as lon; 
as condition 1 is satisfied. 


Obviously, the importance of the sampling theorem lies in the fact that it not only provide 
a method to reconstruct the original signal from the sampled values, but also gives a pre 
cise upper bound on the sampling interval (or equivalently, a lower bound on the samplin; 
frequency) required for distortionless reconstruction. 


Sampling Theorem. Let the signal x(t) have a bandwidth W, i.e., let X(f) =| 
for |f| > W. Let x(t) be sampled at multiples of some basic sampling interval Ts, wher: 
Ts < sk, to yield the sequence {x (nTs)} 2 ... Then it is possible to reconstruct the origina 


s ow? n=—O0O 
signal x(t) from the sampled values by the reconstruction formula 
co 
x(t) = > 2W’T,x(nT,)sinc[2W'(t — nT;)], (7.1.1 
n=—00 


where W’ is any arbitrary number that satisfies the condition 
,_1 
W<W<-—-W. 
Ts 


In the special case where T; = z> we will have W’ = W = T and the reconstructio. 


relation simplifies to 
co t 
x(t) = D x (nT, )sinc (= — n) : 


n=—0OO 
Proof. Let xs(t) denote the result of sampling the original signal by impulses at n7 
time instants. Then 


x(t) = ys x(nT,)5(t — nT,). (7.1.2 
We can write os 
x(t) = x(t) J. êl nT), (7.1.3 


n=—00 


where we have used the property that x(t)ô(t — nT;) = x(nT;)ô(t — nT,). Now if w 
find the Fourier transform of both sides of the preceding relation and apply the dual of th 
convolution theorem to the right-hand side, we obtain 


xn =x al De seat]. (7.1.4 


n=—oo 


Using Table 2.1 to find the Fourier transform of ads (t —nT,), we obtain 


a| > se =n» == 3 s(7-Z). (1.1. 


n=—00 n=—00 





’ 
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Substituting Equation (7.1.5) into Equation (7.1.4), we obtain 


1 foe) 
AND De s(¢-2) 


sd x(¢-F), (7.1.6) 


where, in the last step, we have employed the convenes property of the impulse sig- 
nal, which states X (f) » ô ( f- x)= =X ( f — +). This relation shows that X¿( f), the 
Fourier wansform of the impulse-sampled ae is a replication of the Fourier tanio 
of the original signal at a rate of 7- 1 Figure 7.2 shows a plot of X3(f). Now if T, > sw? 
then the replicated spectrum of x 6) overlaps and reconstruction of the original signal is not 
possible. This type of distortion, which results from undersampling, is known as aliasing 
error or aliasing distortion. However, if T; < sw? no overlap occurs; and by employing an 
appropriate filter we can reconstruct the original signal. To get the original signal back, it is 
sufficient to filter the sampled signal through a lowpass filter with the frequency-response 
characteristics l 


1. H(f) =T, for |f| <W. 
2. H(f) =Ofor|f| > 7 W. 


For W < |f| < + — W, the filter can have any characteristic that makes its implementation 
easy. Of course, o obvious (though not practical) choice is an ideal lowpass filter with 
bandwidth W’, where W’ satisfies W < W’ < T — W, i.e., by using a filter with a transfer 


function given by ; 
HA=7|] (5) . (1.1.7) 


` 


X(f) 








4 1 
T, T, 


Figure 7.2 Frequency-domain representation of the sampled signal. 
























































300 Analog-to-Digital Conversion ` Chapter ; 


With this choice, we have 





X(f) =È] (55) (7.1.8 


Taking the inverse Fourier transform of both sides, we obtain 


x(t) = x(t) x 2W’T,sinc(2W’t) 


( XO x(a)5(t — nT) * 2W'T,sinc(2W't) 
XO 2W’T,x(nT,)sinc (2W' (t — nT;)) . (7.1.9 


n=—0o 


This relation shows that if we use sinc functions for interpolation of the sampled val 
ues, we can perfectly reconstruct the original signal. The sampling rate f, = Ww is thi 
minimum sampling rate at which no aliasing occurs. This sampling rate is known a 
the Nyquist sampling rate. If sampling is done at the Nyquist rate, then the only choic: 
for the reconstruction filter is an ideal lowpass filter and W’ = W = T In this case, 


x(t) = 2 (y) sinc (2Wt — n) 
= 2 era (+ = n) B“ CAN 


In practical systems, sampling is done at a rate higher than the Nyquist rate. Thi 
allows for the reconstruction filter to be realizable and easier to build. In such caseg, th 
distance between two adjacent replicated spectra in the frequency domain, i.e., (Ł —W)- 
W = fs — 2W, is known as the guard band. Therefore, in systems with a guad band, w 
have fs = 2W + Wg, where W is the bandwidth of the signal, Wg is the guard band, anı 
J; is the sampling frequency. 

Note that there exists a strong similarity between our development of the samplin: 
theorem and our previous development of the Fourier transform for periodic signals (o 
Fourier series). In the Fourier transform for periodic signals, we started with a (time) peri 
odic signal and showed that its Fourier transform consists of a sequence of impulses. There 
fore, to define the signal, it was enough to give the weights of these impulses (Fourier-serie 
coefficients). In the sampling theorem, we started with an impulse-sampled signal, or 
sequence of impulses in the time domain and showed that the Fourier transform is a peri 
odic function in the frequency domain. Here again, the values of the samples are enoug 
to define the signal completely. This similarity is a consequence of the duality between th 
time and frequency domains and the fact that both the Fourier-series expansion and th 
reconstruction from samples are orthogonal expansions, one in terms of the exponentiz 
signals and the other in terms of the sinc functions. 
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Example 7.1.1 


In this development, we have assumed that samples are taken at multiples of Ts. What happens 
if we sample regularly with T, as the sampling interval, but the first sample is taken at some 
0< t< T? 


Solution We define a new signal y(t) = x(t + to). Then y(t) is bandlimited with Y (f) = 
e/?*fto X(f) and the samples of y(t) at {kT } -œ are equal to the samples of x(t) at {to + 
kT} _.9: Applying the sampling theorem to the reconstruction of y(t), we have 


foe) 


y(t) = J. ykT,)sinc (2W(t — kT;)) 
k=—00 
foe) 
= J x(q +kT,)sine (2W(t — kT;)) ; 
k=—00 
hence, 
foe) ° 
x(t +to) = D> x(to +kT,)sinc (2W(t — kT,)). 
. k=—00 . 
Substituting t = —to, we obtain the following important interpolation relation: 
oo . 
x(0) = a X(to + kT,)sinc (2W (to + kT;)) « (7.1.11) 
k=- j 
C 
Example 7.1.2 


A bandlimited signal has a bandwidth equal to 3400 Hz. What sampling rate should be used 
to gűarantee a guard band of 1200 Hz? 
Solution We have 

fs =2W + We; 


therefore, 
° fs = 2 x 3400 + 1200 = 8000. a 


After sampling, the continuous-time signal is transformed to a discrete-time signal. 
In other words, the time-axis has been quantized. After this step, we have samples taken 
at discrete times, but the amplitude of these samples is still continuous. The next step in 
analog-to-digital conversion is the quantization of the signal amplitudes. This step results 
in'a signal that is quantized in both time and amplitude. 





7.2 QUANTIZATION 


After sampling, we have a discrete-time signal, i.e., a signal with values at integer multiples 
of T;. The amplitudes of these signals are still continuous, however. Transmission of real 
numbers requires an infinite number of bits, since generally the base 2 representation of 
real numbers has infinite length. After sampling, we will use quantization, in which the 
amplitude becomes discrete as well. As a result, after the quantization step, we will deal 
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with a discrete-time, finite-amplitude signal, in which each sample is represented by 
finite number of bits. In this section, we will study different quantization methods. W 
begin with scalar quantization, in which samples are quantized individually; then, we wi 
explore vector quantization, in which blocks of samples are quantized at a time. 


7.2.1 Scalar Quantization 


In scalar quantization, each sample is quantized into one of a finite number of levels, whic 
is then encoded into a binary representation. The quantization process is a rounding prc 
cess; each sampled signal point is rounded to the “nearest” value from a finite set of po: 
sible quantization levels. In scalar quantization, the set of real numbers R is partitione 
into N disjoint subsets denoted by Rg, 1 < k < N (each called a quantization region 
Corresponding to each subset Rg, a representation point (or quantization level) x, is ch 
sen, which usually belongs to Ng. If the sampled signal at time i, x; belongs to Rg, the 
it is represented by £x, which is the quantized version of x. Then, X, is represented t 
a binary sequence and transmitted. This latter step is called encoding. Since there are , 
possibilities for the quantized levels, log, N bits are enough to encode these levels in: 
binary sequences.! Therefore, the number of bits required to transmit each source outp 
is R = log, N bits. The price that we have paid for representing (rounding) every samp 
that falls in the region N by a single point £x is the introduction of distortion. 

Figure 7.3 shows an example of an eight-level quantization scheme. In this schem 
the eight regions are defined as Ry = (—00, a1], Rz = (a1, a2), ..., Rg = (a7, +00). TI 





Figure 7.3 Example of an 
eight-level quantization scheme 


'N is generally chosen to be a power of 2. If it is not, then the number of required bits would be [log, . 
where [x] denotes the smallest integer greater than or equal to x. To make the development easier, we alw 
assume that N is a power of 2. 
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representation point (or quantized value) in each region is denoted by £; and is shown in 
the figure. The quantization function Q is defined by 


Q (x) = x; forall x € Rj. (7.2.1) 


This function is also shown in the figure. - 

Depending on the measure of distortion employed, we can define the average distor- 
tion resulting from quantization. A popular measure of distortion, used widely in practice, 
is the squared error distortion defined as (x — £). In this expression, x is the sampled 
signal value and x is the quantized value, i.e., x = Q (x). If we are using the squared error 
distortion measure, then 


d(x, 8) = (x — Q (x)? =’, 


where X = x —X = x — Q (x). Since X is a random variable, so are X and X ; therefore, 
the average (mean squared error) distortion is given by 


D = E[d(X, X)] = E(X — Q (X))’. 


Example 7.2.1 
The source X (t) is a stationary Gaussian source with mean zero and power spectral density 


_ f2  |fl < 100Hz 
Se | 0 otherwise 


The source is sampled at the Nyquist rate and each sample is quantized using the eight-level 
quantizer which is shown in Figure 7.3 This figure has a} = —60, az = —40, a3 = —20, ag = 
0,as = 20,a5 = 40,a, = 60, and ĉ& = —70, x, = —50,%; = —30, ĉ& = —10, £; = 
10, £6 = 30, 7 = 50, and tg = 70. What is the resulting distortion and rate? 


Solution The sampling frequency is fs = 200 Hz. Each sample is a zero-mean Gaussian 
random variable with variance 


oe) 100 
0o? = E(X7) = Rx (eco = f Sx(f)df = f 2df = 400, 
=% —100 
where we have used Equations (5.2.18) and (5.2.21). Since each sample is quantized into eight 


levels, log, 8 = 3 bits are sufficient to represent (encode) the sample; therefore, the resulting 
rate is 


R = 3 fs = 600 bits/sec. 
To find the distortion, we have to evaluate E (X — È )? for each sample. Thus, we have 
z o0 
p=Ex-k = | @- wy fax, 
—00 


where fx (x) denotes the probability density function of the random variable X. From here, 
we have 


8 
D=}. he — Q (x)? fx(x)dx, 
i=l : 
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tel or equivalently, 


a 7 Gi 
i D= f (x = Ri)? fx (x) dx + om (x a RY fx(x)dx 
fe —oo i=2 “i-i 


+ T (x — 2s)’ fx (x)dx, (7.2.2 


where fx (x) is plage” fv, Substituting {ai} and EAH in this integral and evaluating th 
result with the Q-function table, we obtain D ~ 33.38. Note that if we were to use zero bii 
per source output, then the best strategy would be to set the reconstructed signal equal to zer 
In this case, we would have a distortion of D = E(X — 0)* = o? = 400. This quantizatio 
scheme and transmission of 3 bits per source output has enabled us to reduce the distortion t 























(| 33.38, which is a factor of 11.98 reduction or 10.78 dB. 
f "In the preceding example, we have chosen E(X — Q (X))*, which is the mea 
4 squared distortion, or quantization noise as the measure of performance. A more mear 
ingful measure of performance is a normalized version of the quantization noise, and it - 
normalized with respect to the power of the original signal. 
Definition 7.2.1. If the random variable X is quantized to Q (X), the signal-t 
quantization noise ratio (SQNR) is defined by 
SQNR = He) (7.2.: 
E(X — Q(X)?" a 
When dealing with signals, the quantization noise power is 
1 rf 
Py = Jim = | E (X(t) — Q (X(N) dt G; 
T=>œ T -7 i 
and the signal power is 
` T 
; 1 rf 
Px = lim = | _ E(X())dt. (7.2. 
x jim. = |, (X"(t))d ( 
Hence, the signal-to-quantization noise ratio is 
p , 
SONR = —. (7.2. 
Pz 
If X (t) is stationary, then this relation simplifies to Equation (7.2.3), where X is the randc 
variable representing X (t) at any point. 
Example 7.2.2 
Determine the SQNR for the quantization scheme given in Example 7.2.1. 














Solution From Example 7.2.1, we have Py = 400 and Pg; = D = 33.38. Therefore, 


Py 400 
Q P} 33.38 


+ r 
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Uniform Quantization. Uniform quantizers are the simplest examples of scalar 
quantizers. In a uniform quantizer, the entire real line is partitioned into N regions. All 
regions except 9, and Ry are of equal length, which is denoted by A. This means that for 
alll <i < N—2, we have aj+ı — a; = A. It is further assumed that the quantization levels 
are at a distance of a from the boundaries a), a2, . . . , ay—1. Figure 7.3 is an example of 
an eight-level uniform quantizer. In a uniform quantizer, the mean squared error distortion 
is given by 


Ds / ' æ- (a, — A/2))? fx(x)dx 


N-2 paytid 
+e [atid — A/D flora 


fay Yat DA 


4 f 7 (x — (a, + (N — DA + A/2))* fx (x)dx. (7.2.7) 
ayt(N-2)A . 3 


Thus, D is a function of two design parameters, namely, a; and A. In order to design the 
optimal uniform quantizer, we have to differentiate D with respect to these variables and 
find the values that minimize D. 

Minimization of distortion is generally a tedious task and is done mainly by numer- 
ical techniques. Table 7.1 gives the optimal quantization level spacing for a zero-mean 
unit-variance Gaussian random variable. The last column in the table gives the entropy 
after quantization which is discussed in Chapter 12. 


Nonuniform Quantization. If we relax the condition that the quantization re- 
gions (except for the first and the last one) be of equal length, then we are minimizing the 
distortion with less constraints; therefore, the resulting quantizer will perform better than 
a unifom quantizer with the same number of levels. Let us assume that we are interested 
in designing the optimal mean squared error quantizer with N levels of quantization with 
no other constraint on the regions. The average distortion will be given by 


+ 


al N-2` Gj 
D = f (x — £1)? fx (x)dx + y f "@ — S41)? fx (x)dx 
=o i=1 "^i , 


{o.0] 
+ f (x — ĉn)? fx (x)dx. . (7.2.8) 

an-1 
There exists a total of 2N — 1 variables in this expression (@),d2,...,@n—; and 
£1, £2, ..., ĉn) and the minimization of D is to be done with respect to these variables. 


Differentiating with respect to a; yields 


ð x A 
za, 7 Axla; — 4° — (ai — ĉi+1)?] = 0, (7.2.9) 
which results in 


Den © 
a; = zæ + Xi41). (7.2.10) 
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TABLE 7.1 OPTIMAL UNIFORM QUANTIZER FOR A GAUSSIAN SOURCE 


Number Output Output-level Mean Squared Entrop 
Levels Spacing Error A(x) 
N A D 

1 = 1.000 0.0 
2 1.596 0.3634 1.000 
3 1.224 0.1902 1.536 
4 0.9957 0.1188 1.904 
5 0.8430 0.08218 2.183 
6 0.7334 0.06065 2.409 
7 0.6508 0.04686 2.598 
8 0.5860 0.03744 2.761 
9 0.5338 0.03069 2.904 
10 0.4908 0.02568 3.032 
11 * 0.4546 0.02185 ` 3.148 
12 0.4238 0.01885 3.253 
13 -0.3972 0.01645 3.35 
14 0.3739 0.01450 3.44€ 
15 03534, 0.01289 l 3.524 
16 03352 59 0.01154 3.602 
17 0.3189 = 3 gy? 0.01040 3.67€ 
18 0.3042 | 0.009430 3.74€ 
19° 0.2909 “7 0.008594 3.81] 
20 ; 0.2788 0.007869 3.87¢ 
21 0.2678 0.007235 3.93: 
22 0.2576 0.006678 3.99( 
23 0.2482 0.006185 4.04: 
24 0.2396 0.005747 4.09; 
25 0.2315 0.005355, 4.14 
26 0.2240 0.005004 4.19% 
27 0.2171 0.004687 4.24: 
28 0.2105 0.004401 4.28: 
29 0.2044 0.004141 4321 
30 0.1987 0.003905 437 
31 0.1932 0.003688 4.411 
32 0.1881 0.003490 4.44 
33 0.1833 0.003308 Oo 44g 
34 0.1787 0.003141 = ASL 
35 0.1744 0.002986 4.561 


36 0.1703 0.002843 mw 4,59: 
From Max (1960); © IEEE. 


This result simply means that, in an optimal quantizer, the boundaries of the quantiz 
tion regions are the midpoints of the quantized values. Because quantization is done on 
minimum distance basis, each x value is quantized to the nearest {x; nae 

To determine the quantized values x;, we differentiate D withrespect to £; and defi 
ag = —œ and ay = +00. Thus, we obtain 
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= f ' 2(x — ĉi) fx (x)dx = 0, (7.2.11) 
ai—1 


which results in 


ai 
i= Jay Ex Gd (7.2.12) 
JE fxd 

Equation (7.2.12) shows that in an optimal quantizer, the quantized value (or representation 
point) for a region should be chosen to be the centroid of that region. Equations (7.2.10) 
and (7.2.12) give the necessary conditions for a scalar quantizer to be optimal; they are 
known as the Lloyd—Max conditions. The criteria for optimal quantization (the Lloyd~Max 
conditions) can then be summarized as follows: 


1. The boundaries of the quantization regions are the midpoints of the corresponding 
quantized values (nearest neighbor law). 


2. The quantized values are the centroids of the quantization regions. 


Although these rules are very simple, they do not result in analytical solutions to 
the optimal quantizer design. The usual method of designing the optimal quantizer is to 
start with a set of quantization regions and then, using the second criterion, to find the 
quantized values. Then, we design new quantization regions for the new quantized values, 
and alternate between the two steps until the distortion does not change much from one 
step to the next. Based on this method, we can design the optimal quantizer for various 
source. statistics. Table 7.2 shows the optimal nonuniform quantizers for various values 
of N for a zero-mean unit-variance Gaussian source. If, instead of this source, a general 
Gaussian source with mean m and variance o” is used, then the values of a; and £; read 
from Table 7.2 are replaced with m + oa; and m + oX;, respectively, and the value of the 
distortion D will be replaced by øo? D. 


Example 7.2.3 


How would the results of Example 7.2.1 change if, instead of the uniform quantizer shown in 
Figure 7.3, we used an optimal nonuniform quantizer with the same number of levels? 


Solution We can find the quantization regions and the quantized values from Table 7.2 with 
N = 8, and then use the fact that our source is an N (0, 400) source, i.e., m = 0 anda = 20. 
Therefore, all a; and £; values read from the table should be multiplied by 0 = 20 and the 
distortion has to be multiplied by 400. This gives us the values aj = —a7 = —34.96, a) = 
—dg = —21, a3 = —as = —10.012,-a4 = 0 and £; = —X%g = —43.04, ĉ&2 = —X) = —26.88, 
3 = —X%6 = —15.12, £4 = —X5 = —4.902 anda distortion of D = 13.816. The SQNR is 


400 
13.816 


which is 3.84 dB better than the SQNR of the uniform quantizer. a 


SQNR = = 28.95 ~ 14.62 dB, 
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TABLE 7.2 OPTIMAL NONUNIFORM QUANTIZER FOR A GAUSSIAN SOURCE 


Chapter 





2 


CAIYDUWNPWN 


10 


17 


19 


20 


21 


22 


0 

0.6120 

0, 0.9816 
0.3823, 1.244 


0, 0.6589, 1.447 
0.2803, 0.8744, 1.611 
0, 0.5006, 1.050, 1.748 


0.2218, 0.6812, 1.198, 
1.866 

0, 0.4047, 0.8339, 1.325, 
1.968 

0.1837, 0.5599, 0.9656, 
1.436, 2.059 

0, 0.3401, 0.6943, 1.081, 
1.534, 2.141 

0.1569, 0.4760, 0.8126, 
1.184, 1.623, 2.215 

0, 0.2935, 0.5959, 0.9181, 
1.277, 1.703, 2.282 
0.1369, 0.4143, 0.7030, 
1.013, 1.361, 1.776, 2.344 
0, 0.2582, 0.5224, 0.7996, 
1.099, 1.437, 1.844, 2.401 


0.1215, 0.3670, 0.6201, 
0.8875, 1.178, 1.508, 1.906, 
2.454 

0, 0.2306, 0.4653, 0.7091, 
0.9680, 1.251, 1.573, 1.964, 
2.504 

0.1092, 0.3294, 0.5551, 
0.7908, 1.042, 1.318, 1.634, 
2.018, 2.55 

0, 0.2083, 0.4197, 0.6375, 
0.8661, 1.111, 1.381, 1.690, 
2.068, 2.594 

0.09918, 0.2989, 0.5027, 
0.7137, 0.9361, 1.175, 
1.440, 1.743, 2.116, 2.635 
0, 0.1900, 0.3822, 0.5794, 
0.7844, 1.001, 1.235, 1.495, 
1.793, 2.160, 2.674 


4%; 
0 
0.7980 
0, 1.224 
0.4528, 1.510 


0, 0.7646, 1.724 

0.3177, 1.000, 1.894 

0, 0.5606, 1.188, 2.033 
0.2451, 0.7560, 1.344, 
2.152 

0, 0.4436, 0.9188, 1.476, 
2.255 

0.1996, 0.6099, 1.058, 
1.591, 2.345 

0, 0.3675, 0.7524, 1.179, 
1.693, 2.426 

0.1684, 0.5119, 0.8768, 
1.286, 1.783, 2.499 

0, 0.3138, 0.6383, 0.9870, 
1.381, 1.865, 2.565 
0.1457, 0.4413, 0.7505, 
1.086, 1.468, 1.939, 2.625 
0, 0.2739, 0.5548, 0.8512, 
1.175, 1.546, 2.007, 2.681 
0.1284, 0.3881, 0.6568, 
0.9424, 1.256, 1.618, 
2.069, 2.733 

0, 0.2430, 0.4909, 0.7493, 
1.026, 1.331, 1.685, 2.127, 
2.781 

0.1148, 0.3464, 0.5843, 
0.8339, 1.102, 1.400, 1.746, 
2.181, 2.826 

0, 0.2184, 0.4404, 0.6698, 
0.9117, 1.173, 1.464, 1.803, 
2.232, 2.869 

0.1038, 0.3128, 0.5265, 
0.7486, 0.9837, 1.239, 
1.524, 1.857, 2.279, 2.908 
0, 0.1984, 0.3994, 0.6059, 
0.8215, 1.051, 1.300, 1.579, 
1.908, 2.324, 2.946 
0.09469, 0.2852, 0.4793, 
0.6795, 0.8893, 1.113, 
1.357, 1.632, 1.955, 2.366, 
2.982 


1 
0.3634 
0.1902 
0.1175 
0.07994 
0.05798 
0.04400 
0.03454 
0.02785 
0.02293 
0.01922 
0.01634 
0.01406 
0.01223 
0.01073 


0.009497 


0.008463 


0.007589 


0.006844 


0.006203 


0.005648 


a 


0.005165 


+a; 


3.849 


3.928 


(continued overlec 
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TABLE 7.2 (CONTINUED) 
N +a; +3; D H(X) 
23 0.09085, 0.2736, 0.4594, 0, 0.1817, 0.3654, 0.5534, 0.004741 4.268 
0.6507, 0.8504, 1.062, 0.7481, 0.9527, 1.172, 
1.291, 1.546, 1.841, 2.203, 1.411, 1.682, 2.000, 2.406, 
2.711 3.016 
24 0, 0.1746, 0.3510, 0.5312, 0.08708, 0.2621, 0.4399, 0.004367 4.327 
0.7173, 0.9122, 1.119, 1.344, 0.6224, 0.8122, 1.012, ` 
1.595, 1.885, 2.243, 2.746 1.227, 1.462, 1.728, 2.042, 
2.444, 3.048 
25 0.08381, 0.2522, 0.4231, 0, 0.1676, 0.3368, 0.5093, 0.004036 4.384 
0.5982, 0.7797, 0.9702, 0.6870, 0.8723, 1.068, 
1.173, 1.394, 1.641, 1.927, 1.279, 1.510, 1.772, 2.083, 
2.281, 2.779 2.480, 3.079 
26 0, 0.1616, 0.3245, 0.4905, 0.08060, 0.2425, 0.4066, 0.003741 4.439 
0.6610, 0.8383, 1.025, 0.5743, 0.7477, 0.9289, 
1.224, 1.442, 1.685, 1.968, 1.121, 1.328, 1.556, 1.814, 
2.318, 2.811 2.121, 2.514, 3.109 
27 0.07779, 0.2340, 0.3921, 0, 0.1556, 0.3124, 0.4719, 0.003477 4.491 
0.5587, 0.7202, 0.8936, 0.6354, 0.8049, 0.9824, 
1.077, 1.273, 1.487, 1.727, 1.171, 1.374, 1.599, 1.854, 
2.006, 2.352, 2.842 2.158, 2.547, 3.137 
28 0, 0.1503, 0.3018, 0.04556, 0.07502, 0.2256, 0.3780, 0.003240 4.542 


0.6132, 0.7760, 0.9460, 
1.126, 1.319, 1.529, 1.766, 
2.042, 2.385, 2.871 


0.5333, 0.6930, 0.8589, 
1.033, 1.118, 1.419, 1.640, 
1.892, 2.193, 2.578, 3.164 








From Max (1960); © IEEE. 


7.2.2 Vector Quantization 


In scalar quantization, each output of the discrete-time source (which is usually the.result 
of sampling, of a continuous-time source) is quantized separately and then encoded. For 
example, if we are using a four-level scalar quantizer and encoding each level into two 
bits, we are using two bits per each source output. This quantization scheme is shown in 
Figure 7.4. _ ; 

Now if we consider two samples of the source at each time, and we interpret these 
two samples as a point in a plane, the. quantizer partitions the entire plane into 16 quanti- 
zation regions, as show in Figure 7.5. We can see that the regions in the two-dimensional 
space are all of rectangular shape. If we allow 16 regions of any shape in the two- 
dimensional space, we are capable of obtaining better results. This means that we are 
quantizing two source outputs at astime by using 16 regions, which is equivalent to four 
bits per two source outputs or two bits per each source output. Therefore, the number of 
bits per source output for quantizing two samples at a time is equal to the number of bits 
per source output obtained in the scalar case. Because we are relaxing the requirement of 
having rectangular regions, the performance may improve. Now, if we take three samples 
at a time and quantize the entire three-dimensional space into 64 regions, we will have 
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Figure 7.4 A four-level scalar 
quantizer. 





Figure 7.5 A scalar four-level quantization applied to two samples. 


even less distortion with the same number of bits per source output. The idea of vect: 
quantization is to take blocks of source outputs of length n, and design the quantizer in tt 
n-dimensional Euclidean space, rather than doing the quantization based on single sampk 
in a one-dimensional space. 

Let us assume that the quantization regions in the n-dimensional space are denote 
by Ri, 1 < i < K. These K regions partition the n-dimensional space. Each block « 
source output of length n is denoted by the n-dimensional vector x € R”; if x € Ri, it 
quantized to Q (x) = X;. Figure 7.6 shows this quantization scheme for n = 2. Now, sin 
there are a total of K quantized values, log K bits are enough to represent these value 
This means that we require log K bits per n source outputs, or the rate of the source code 


log K 
R= a bits/source output. (7.2.1 
n 


The optimal vector quantizer of dimension n and number of levels K chooses tl 
regions R;’s and the quantized values X;’s such that the resulting distortion is minimize 
Applying the same procedure that we used for the case of scalar quantization, we obta 
the following criteria for an optimal vector quantizer design: 


1. Region J; is the set of all points in the n-dimensional space that are closer to x; th 
any other X;, for all j # i; i.e., 


Ri = {x € R” : ||x— Kl] < [k-—Kll, Yj # i} 
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. Figure 7.6 Vector quantization in two dimensions. 


2. &; is the centroid of the region 9y; i.e., 


= Fax em |g: | Om 
Xi = BOXER) a x fx(x) X. 


A practical approach to designing optimal vector quantizers is on the basis of the 
same approach employed in designing optimal scalar quantizers. Starting from a given set 
of quantization regions, we derive the optimal quantized vectors for these regions by using 
criterion 2. Then we repartition the space using the first criterion, and go back and forth 
until the changes in distortion are negligible. 

Vector quantization has found widespread applications in speech and image coding; 
numerous algorithms for reducing its computational complexity have been proposed. 


7.3 ENCODING 


In the encoding process, a sequente of bits are assigned to different quantization values. 
Since there are a total of N = 2” quantization levels, v bits are sufficient for the encoding 
process. In this way, we have v bits corresponding to each sample; since the sampling rate 
is fs samples/sec, we will have a bit rate of R = vf, bits/sec. 

The assignment of bits to quantization levels can be done in a variety of ways. In 
scalar quantization, a natural way of encoding is to assign the values of 0 to N — 1 to 
different quantization levels starting from the lowest level to the highest level in order of 
increasing level value. Then we can assign the binary expansion of the numbers 0 to N — 1 
to these levels. Thus, v zeros are assigned to the lowest quantization level, 0...01 to the 


v-1 
second lowest quantization level, 0...0 10 to the next level, ...and 1... 1 to the highest 
“aS —’ ~ 


v—2 v 
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TABLE 7.3 NBC AND GRAY CODES FOR A 16-LEVEL QUANTIZATION 


Quantization Level Level Order NBC Code Gray Code 











o [o | m] w 
o T 
e m | 
— io | 
| X43 1100 1001 
rs 0 | 
e T 

X16 15 1111 1011 


quantization level. This type of encoding is called natural binary coding or NBC for short 
Another approach to coding is to encode the quantized levels in a way that adjacent levels 
differ only in one bit. This type of coding is called Gray coding. 

Table 7.3 gives an example of NBC and Gray coding for a quantizer with N = 1€ 
levels. 


7.4 WAVEFORM CODING 


Waveform coding schemes are designed to reproduce the waveform output of the source a 
the destination with as little distortion as possible. In these techniques, no attention is paic 
to the mechanism that produces the waveform; all attempts are directed at reproducing the 
source output at the destination with high fidelity. The structure of the source plays no rok 
in the design of waveform coders and only properties of the waveform affect the design 
Thus, waveform coders are robust and can be used with a variety of sources as long a: 
the waveforms produced by the sources have certain similarities. In this section, we stud: 
some basic waveform coding methods that are widely applied to a variety of sources. 
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7.4.1 Pulse Code Modulation 


Pulse code modulation (PCM) is the simplest and oldest waveform coding scheme. A pulse 
code modulator consists of three basic sections: a sampler, a quantizer and an encoder. A 
functional block diagram of a PCM system is shown in Figure 7.7. In PCM, we make the 
following assumptions: 


1. The waveform (signal) is bandlimited with a maximum frequency of W. Therefore, 
it can be fully reconstructed from samples taken at a rate of fs = 2W or higher. 
2. The signal is of finite amplitude. In other words, there exists a maximum amplitude 
< Xmax Such that for all t, we have |x(t)| < Xmax. 


3. The quantization is done with a large number of quantization levels N, which is a 
power of 2 (N = 2”). 


The waveform entering the sampler is a bandlimited waveform with the bandwidth W. 
Usually, there exists a filter with bandwidth W prior to the sampler to prevent any com- 
ponents beyond W from entering the sampler. This filter is called the presampling filter. 
The sampling is done at a rate higher than the Nyquist rate; this allows for some guard 
band. The sampled values then enter a scalar quantizer. The quantizer is either a uniform 
quantizer, which results in a uniform PCM system, or a nonuniform quantizer. The choice 
of the quantizer is based on the characteristics of the source output. The output of the quan- 
tizer is then encoded into a binary sequence of length v, where N = 2” is the number of 
quantization levels. i 


Uniform PCM. In uniform PCM, we assume that the quantizer is a uniform 
quantizer. Since the range of the input samples is [—Xmax, +Xmax] and the number of quan- 
tization levels is N, the length of each quantization region is given by 


2Xmax _ Xmax 
N = Qv-1° 
The quantized values in uniform PCM are chosen to be the midpoints of the quantization 
regions; therefore, the error x = x — Q (x) is a random variable taking values in the 
interval (~ê, + $]. In ordinary PCM applications, the number of levels (N) is usually high 
and the range of variations of the input signal (amplitude variations Xmax) is small. This 
means that the length of each quantization region (A) is small. Under these assumptions, 
in each quantization region, the error X = X — Q (X) can be approximated by a uniformly 








A= (7.4.1) 


distributed random variable on (-4, +4]. In other words, z J 
1 Aoz TCA Ve 
fH ays 2547 2 
0 otherwise 


x(t {Xn} x ...0110... 


' Figure 7.7 Block diagram of a PCM system. 
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The distortion introduced by quantization (quantization noise) is therefore 








5 +4 1 Ae y2 . 
E[X’] = ~i di = = T = n 7.4.2 
[x] f A SDB BN? 3 cae’ CAA 


where v is the number of bits/source sample and we have employed Equation (7.4.1). The 
signal-to-quantization noise ratio then becomes 


Py 3 x N* Py 3 x 4" Px 








i SQNR = = = —— = =z `; (7.4.3) 
K X2 Kanak Xmax ` 
| } where Px is the power in each sample. In the case where X (t) is a wide-sense stationary 
| Pi process, Py can be found using any of the following relations: ( 
Px = Rx(T),_9 
2 . 
=| saunas 
—0o 
co 
-f x? fy(x) dx. 
—0o 


Note that since Xmax is the maximum possible value for X, we always have Py = E[X?] < 
ax This means that = p = <1 (usually 5%- < 1); hence, 3N? = 3 x 4” is an upperbounė 
to phe SQNR in cee PCM. This also means that SQNR in uniform PCM deteriorates 
as the dynamic range of the source increases because an increase in the dynamic range ol 
the source results in a decrease in —*- x 
Expressing SQNR in decibels “we obtain 





x2 
Xmax 


Px 

PONR ~ 10 logiy + 6v + 4.8. (7.4.4 
dB 7 

We can see that each extra bit (increase in v by one) increases the SQNR by 6 dB. This i: 


a very useful strategy for estimating how many extra bits are required: to achieve a desirec 
SQNR. 


Example 7.4.1 
Whatis the resulting SQNR for a signal uniformly distributed on [—1, 1], when Mnu orn PCN 
with 256 levels is employed? 














1 1 
Solution Wehave Py = Si ye ax a Therefore, using Xmax = 1 and v = log 256 = 8 





we have 
SQNR = 3 x 4” x Py = 4” = 65536 ~ 48.16 dB. (7.4.5 
á I 





| The issue of bandwidth requirements of pulse transmission systems, of which PCN 
is an example, is dealt with in detail in Chapter 10. In this chapter, we briefly discuss som: 
results concerning the bandwidth requirements of a PCM system. If a signal has a band 
width of W, then the minimum number of samples for perfect reconstruction-of the signa 


t 


t 
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is given by the sampling theorem, and itis equal to 2W samples/sec. If some guard band 
is required, then the number of samples per second is fs, which is more than 2W. For each 
sample, v bits are used; therefore, a total of vf, bits/sec are required for transmission of the 
PCM signal. In the case of sampling at the Nyquist rate, this is equal to 2vW bits/sec. The 
minimum bandwidth requirement for binary transmission of R bits/sec (or, more precisely, 
R pulses/sec) is R, (See Chapter 10.)? Therefore, the minimum bandwidth requirement of 
a PCM system is ` 
vfs 


pi 
which, in the case of sampling at the Nyquist rate, gives the absolute minimum bandwidth 
requirement for pulse transmission as 


BWreq = (7.4.6) 


BWreq = VW. (7.4.7) 


This means that a PCM system expands the bandwidth of the original signal by a factor of 
at least v. 


Nonuniform PCM. As long as the statistics of the input signal are close to the 
uniform distribution, uniform PCM works fine. However, in coding of certain signals such 
as speech, the input distribution is far from uniform. For a speech waveform, in particular, 
there exists a higher probability for smaller amplitudes and a lower probability for larger 
amplitudes. Therefore, it makes sense to design a quantizer with more quantization regions 
at lower amplitudes and fewer quantization regions at larger amplitudes. The resulting 
quantizer will be a non-uniform quantizer that has quantization regions of various sizes. 

The usual method for performing nonuniform quantization? is to first pass the 
samples through a nonlinear element that compresses the large amplitudes (reduces the 
dynamic range of the signal) and then performs a uniform quantization on the output. At 
the receiving end, the inverse (expansion) of this nonlinear operation is applied to obtain the 
sampled value. This technique is called companding (compressing--expanding). A block 
diagram of this system is shown in Figure 7.8. 

There are two types of companders that are widely used for speech coding. The 
-law compander, used in the United States and Canada, employs the logarithmic function 
at the transmitting side, where |x| < 1: 


_ log(l + u Ix) 
Eœ = ogl +m 


The parameter u controls the amount of compression and expansion. The standard PCM 
system in the United States and Canada employs a compressor with u = 255 followed 
by a uniform quantizer with 8 bits/sample. Use of a compander in this system improves 
the performance of the system by about 24 dB. Figure 7.9 illustrates the u-law compander 
characteristics for u = 0, 5, and 255. 


sgn(x). (7.4.8) 


2A more practical bandwidth requirement is 2 where 1 <a < 2. 
Sometimes, the term nonlinear quantization is used. This is misleading, because all quantization 
schemes, uniform or nonuniform, are nonlinear. 
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{Xn} H. Ea 
Compressor Uniform Expander | Reconstruction | 
gex) Taea PCM sa Deer | Decode? g(x) | fter | 


Figure 7.8 Block diagram of a nonuniform PCM system. 


1 . 


a(x) 
© 
WN 


Figure 7.9 A graphof -law compander characteristics. 


The second widely used logarithmic compressor is the A-law compander. The char 
acteristics of this compander are given by 


1 + log A|x| e 


g= log A 


gn(x), (7.4.9 
where A is chosen to be 87.56. The performance of this compander is comparable to thi 
performance of the u-law compander. Figure 7.10 illustrates the characteristics of thi 
compander for A = 1, 2, and 87.56. 


7.4.2 Differential Pulse Code Modulation 


In a PCM system, after sampling the information signal, each sample is quantized inde 
pendently using a scalar quantizer. This means that previous sample values have no effec 
on the quantization of the new samples. However, when a bandlimited random process i 
sampled at the Nyquist rate or faster, the sampled values are usually correlated randon 
variables. The exception is the case when the spectrum of the process is flat within it 
bandwidth. This means that the previous samples give some information about the nex 
sample; thus, this information can be employed to improve the performance of the PCN 


t é 
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Figure 7.10 A graph of A-law compander characteristics. 


system. For instance, if the previous sample values were small, and there is a high proba- 
bility that the next sample value will be small as well, then it is not necessary to quantize a 
wide range of values to achieve a good performance. 

In the simplest form of differential pulse code modulation (DPCM), the difference 
between two adjacent samples is quantized. Because two adjacent samples are highly cor- 
related, their difference has small variations; therefore, to achieve a certain level of perfor- 
mance, fewer levels (and therefore fewer bits) are required to quantize it. This means that 
DPCM can achieve performance levels at lower bit rates than PCM. 

Figure 7.11 shows a block diagram of this simple DPCM scheme. As seen in the 
figure, the input to the quantizer is not simply Xn — Xn-1 but rather X, — Vix We will 
see that Vis is closely related to X,-1, and this choice has an advantage because the 
accumulation of quantization noise is prevented. The input to the quantizer Y„ is quantized 
by a scalar quantizer (uniform or nonuniform) to produce Y,,. Using the relations 





Yn = Xn- Êi (7.4.10) 
Ê, i, x 
Q 
Ê, 
T=1 





Figure 7.11 A simple DPCM encoder and decoder. 
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and R 
Y; = Y, + Yp (7.4.11) 


we obtain the quantization error between the input and the output of the quantizer as 


y Yn — Yn = Yn — (Xn — Ê) 


=f,- Xn + oer, 
=i = Xy (1.4.12 
At the receiving end, we have 
Xn = În + Ên. (7.4.13. 


Comparing Equation (7.4.11) with Equation (7.4.13) we see that Y! and Ên satisfy the 
same difference equation with the same excitation function (È). Therefore, if the initia’ 
conditions of Y! and Ê n are chosen to be the same, they will be equal. For instance, if we 
let ¥’, = X_1 = 0, then all n will have Ê! = X,. Substituting this in Equation (7.4.12) 
we obtain 

an fee ee oe (7.4.14 


This shows that the quantization error between X,, and its reproduction X, is the same a: 
the quantization error between the input and the output of the quantizer. However, the range 
of variations of Y,, is usually much smaller than that of X,,; therefore, Y„ can be quantizec 
with fewer bits. 


Example 7.4.2 
Speech signal is bandlimited to 3 kHz and sampled at the rate of 8 kHz. To achieve the sam« 
quality of distortion PCM requires 8 bits/sample and DPCM requires 4 bits/sample. Determin 
the bit rates required to transmit the PCM and DPCM encoded signals 


Solution For PCM, we have 
R = vfs = 8 x 8000 = 64,000 bps 


and for DPCM 
R = vfs = 4 x 8000 = 32,000 bps. 1 


7.4.3 Delta Modulation 


Delta modulation (DM) is a simplified version of the DPCM system shown in Figure 7.11 
In delta modulation, the quantizer is a one-bit (two-level) quantizer with magnitudes +A 
A block diagram of a DM system is shown in Figure 7.12. The same analysis that wa 
applied to the simple DPCM system is valid here. 

In DM only one bit per sample is employed, so the quantization noise will be hig] 
unless the dynamic range of Y,, is very low. This, in turn, means that X,, and X,_, must havı 
a very high correlation coefficient. To have a high correlation between X„ and X,—1, W 
have to sample at rates much higher than the Nyquist rate. Therefore, in DM, the samplin; 
rate is usually much higher than the Nyquist rate, but since the number of bits per sampl 
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Figure 7.12 Delta modulation. 


is only one, the total number of bits per second required to transmit a waveform may be 
lower than that of a PCM system. 

A major advantage of DM is the very simple structure of the system. At the receiving 
end, we have the following relation for the reconstruction of Xn: 


n- 1 =Ê. (7.4.15) 


Solving this equation for È n, and assuming zero initial conditions, we obtain 
n 
R=) Ê. (7.4.16) 
i=0 


This means that to obtain X n», we only have to accumulate the values of Ê,. If the sam- 
pled values are represented by impulses, the accumulator will be a simple integrator. This 
simplifies the block diagram of a DM system, as shown in Figure 7.13. 

The step size A is a very important parameter in designing a delta modulator system. 
Large values of A cause the modulator to follow rapid changes in the input signal; but 
at the samestime, they cause excessive quantization noise when the input changes slowly. 
This case is shown in Figure 7.14. For large A, when the input varies slowly, a large quan- 
tization noise occurs; this is known as granular noise. The case of a too small A is shown 
in Figure 7.15. In this case, we have a problem with rapid changes in the input. When the 


“> 


A 
Xn 
Aceumulato-— > 





\ _ stg 
Figure 7.13 Delta modulation with integrators. l 
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t Figure 7.14 Large A and granular noise. 


Figure 7.15 Small A and slope overload 
t distortion. 


input changes rapidly (high-input slope), it takes a rather long time for the output to follo 
the input, and an excessive quantization noise is caused in this period. This type of di 
tortion, which is caused by the high slope of the input waveform, is called slope overloc 
distortion. 


Adaptive Delta Modulation. We have seen that a step size that is too lar; 
causes granular noise, and a step size too small results in slope overload distortion. Th 
means that a good choice for A is a “medium” value; but in some cases, the performan 
of the best medium value (i.e., the one minimizing the mean squared distortion) is n 
satisfactory. An approach that works well in these cases is to change the step size accordii 
to changes in the input. If the input tends to change rapidly, the step size must be large 
that the output can follow the input quickly and no slope overload distortion results. Wh 
the input is more or less flat (slowly varying), the step size is changed to a small value 
prevent granular noise. Such changes in the step size are shown in Figure 7.16. 

To adaptively change the step size, we have to design a mechanism for recognizi 
large and small input slopes. If the slope of the input is small, the output of the quantiz 
Ê alternates between A and —A, as shown in Figure 7.16. This is the case where granu 
noise is the main source of noise, and we have to decrease the step size. However, in t 
case of slope overload, the output cannot follow the input rapidly and the output of t 
quantizer will be a succession of +A’s or —A’s. We can see that the sign of two successi 
Y,’s isa good criterion for changing the step size. If the two successive outputs have | 
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Figure 7.16 Performance of adaptive delta 
t modulation. 


same sign, the step size should be increased; if they are of opposite signs, it should be 
decreased. . 
A particularly simple rule to change the step size is given by 


An = An Ke, (7.4.17) 


where ép is the output of the quantizer before being scaled by the step size and K is some 
constant larger than 1. It has been verified experimentally that in the 20-60 kilobits/sec 
range, with a choice of K = 1.5, the performance of adaptive delta modulation systems is 
5-10 dB better than the performance of delta modulation when applied to speech sources. 


| 7.5 ANALYSIS-SYNTHESIS TECHNIQUES 


In contrast to waveform coding, analysis—synthesis techniques are methods that are based 
on a model for the mechanism that produces the waveform. The parameters of the model 
that are extracted from the source waveform are quantized, encoded, and transmitted to the 
receiving end. At the receiving end, based on the received information, the same model is 
synthesized andused to generate an output similar to the original waveform. These systems 
are mainly used for speech coding. In this section, we will briefly treat a system known as 
linear predictive coding (LPC). 

Speech-is produced as a result of excitation of the vocal tract by the vocal cords. This 
mechanism can be modeled as a time-varying filter (the vocal tract) excited by a signal 
generator. The vocal tract is a combination of the throat, the mouth, the tongue, the lips, 
and the nose. They change shape during generation of speech; therefore, the vocal tract is 
modeled as a ‘time-varying system. The properties of the excitation signal highly depend 
on the type of speech sounds; they can be either voiced or unvoiced. For voiced speech, 
the excitation can be modeled as a periodic sequence of impulses at a frequency fo, the 
value of which depends on the speaker. The reciprocal + is called the pitch period. For 
unvoiced speech, the excitation is well modeled as a white noise. This model is shown in 
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Figure7.17 Model for speech generation mechanism. 
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Figure 7.17. The vocal tract filter is usually modeled as an all-pole filter described by thy 
difference equation 


P 
X, = ys a;Xn-i + Gwn, (7.5.1 
i=] 


where w, denotes the input sequence (white noise or impulses), G is a gain parameter 
{ai } are the filter coefficients, and p is the number of poles of the filter. The process w, 
which represents the part of X, that is not contained in the previous p samples, is callec 
the innovation process. 

Speech signals are known to be stationary for short periods of time, such as 20- 
30 msec. This characteristic behavior follows from the observation that the vocal trac 
cannot change instantaneously. Hence, over 20-30 msec intervals, the all-pole filter coef 
ficients may be assumed to be fixed. At the encoder, we observe a 20-30 msec record o 
speech from which we estimate the model parameters {a;}, the type of excitation signa 
(white noise or impulse), the pitch period A if the speech is voiced, and the gain parame 
ter G. 

To elaborate on this process, suppose that the speech signal is filtered to 3 kHz anı 
sampled at a rate of 8000 samples/sec. The signal samples are subdivided into blocks o 
160 samples, corresponding to 20 msec intervals. Let {x,, 1 < n < 160} be the sequenc 
of samples for a block. The encoder must measure the model parameters to be transmitte: 
to the receiver. 

Linear prediction is used to determine the filter coefficients at the encoder. A linea 
predictor of order p is an all-zero digital filter with input {x,,} and output 


P 
n=) atxak forl<sa<N, (7.5.2 
k=1 


where we have assumed that x,, = 0 is outside the interval of interest. Figure 7.18 illustrate 
the functional block diagram for the prediction process. The difference between the actuz 
speech sample x, and the predicted value x, constitutes the prediction error eņ, i.e., 


€n = Xn — Xn 


P 
— 5 AkXn—k. (7.5.3 
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Figure 7.18 Functional block diagram of linear prediction. 


In order to extract as much information as possible from the previous values of X,,, we 
choose the coefficients {a;} so that the average of the squared error terms, i.e., 


1 N 
eae 2 
Sp = N de 
1X P 2 
=F 5D (s -5 ams) f (7.5.4) 
n=1 k=1 


is minimized. Differentiating @, with respect to each of the prediction filter coefficients 
{a;} and setting the derivative to zero, we obtain a set of linear equations for the filter 
coefficients, i.e., 


N N p 
1 1 ‘ 
ay 2 XnXn-i = 7 2 a akXn—iXn-k forl <i<p.- (7.5.5) 


Since we have assumed that outside the stationary interval, 1 < n < N, we have x, = 0, 
we can write the preceding relation as 


1 2 ER e 
N D XnXn-i = N D YO an Xn—i%n—k 
n=—00 n=—00 k=1 
P 1 co 
— ya N D set l (7.5.6) 
k=1 n=—00 
Now if we define 
O ee ; 
Ri = N XnXn-is . (7.5.7) 
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we can write Equation (7.5.6) as 


P 
Ê; = Sa Rix for 1 <i < p. (7.5.8 
k=1 


These equations are called the Yule-Walker equations. We can express them in the matri; 
form 


r=Ra, (7.5.9 


where a is the vector of the linear predictor coefficients, Risa p x p matrix whose (i, j)tl 
element is R;—j, and r is a vector whose components are R;’s. It can be easily verified fron 
the definition of R; that 


A 


R; = Ri: (7.5.10 


therefore, the matrix Risa symmetric matrix. Also, it is obvious that all elements of Í 
that are on a parallel line to the diagonal elements are equal. Such a matrix is called ; 
Toeplitz matrix, and efficient recursive algorithms exist for finding its inverse and, thus 
solving Equation (7.5.9) for the vector of predictor coefficients. One such algorithm is th 
well-known Levinson—Durbin algorithm. Refer to the references at the end of this chapte 
for the details of this algorithm. 

For the optimal choice of the predictor coefficients, the squared error term can b 
shown to be 


P 
gmin = Ry — D Ry. (7.5.11 


Lae p i 
Ep” =a 5 E = Sas] 
n=1 k=] 
1 
= C D we. (7.5.12 
n=1 F 


If we normalize the excitation sequence {wp} such that x i w? = 1, we*obtain th 
value of the gain parameter as 


G = J85", (7.5.13 


The estimation of the type of excitation (impulsive or noise), as well as the estimat 
of the pitch period A (when the excitation consists of impulses), may be accomplished b 
various algorithms. One simple approach is to transform the speech data into the frequenc 
domain and look for sharp peaks in the signal spectrum. If the spectrum exhibits peaks : 
some fundamental frequency fo, the excitation is taken to be a periodic impulse train wit 
the period T If the spectrum of the speech samples exhibits no sharp peaks, the excitatio 
is taken as white noise. 
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The prediction filter coefficients, gain, voiced—unvoiced information, and pitch 4 
are quantized and transmitted to the receiver for each block of sampled speech. The speech 
signal is synthesized from these parameters using the system model shown in Figure 7.17. 
Typically, the voiced—unvoiced information requires one bit, the pitch frequency is rep- 
resented by 6 bits, the gain parameter can be represented by 5 bits using logarithmic 
companding, and the prediction coefficients require 8—10 bits/coefficient. Based on LPC, 
speech can be compressed to bit rates as low as 2400 bits/sec. We could alternatively use 
vector quantization when quantizing the LPC parameters. This would further reduce the 
bit rate. In contrast, PCM applied to speech has a bit rate of 64,000 bits/sec. 

LPC is widely used in speech coding to reduce the bandwidth. By vector quantizing 
the LPC parameters, good quality speech is achieved at bit rates below 10,000 bits/sec in 
mobile (cellular) telephone systems. 


7.6 DIGITAL AUDIO TRANSMISSION AND DIGITAL AUDIO RECORDING 





Audio signals constitute a large part of our daily communications. Today, thousands of 
radio stations broadcast audio signals in analog form. The quality of voice signal broad- 
casting is generally acceptable, as long as the voice signal is intelligible. On the other 
hand, the quality of music signals that are broadcast via AM radio is relatively low fidelity 
because the bandwidth of the transmitted signal is restricted through regulation (by the 
Federal Communications Commission in the United States). The FM radio broadcast of 
analog signals provides higher fidelity by using a significantly larger channel bandwidth 
for signal transmission. Commercial radio broadcasting of audio signals in digital form has 
already begun with the advent of satellite radio systems. 

In the transmission of audio signals on telephone channels, the conversion from 
analog-to-digital transmission, which has been taking place over the past several decades, 
is now nearly complete. We will describe some of the current developments in the digital 
encoding of audio signals for telephone transmission. 

The entertainment industry has experienced the most dramatic changes and benefits 
in the conversion of analog audio signals to digital form. The development of the compact 
disc (CD) player and the digital audio tape recorder have rendered the previous analog 
recording systems technically obsolete. We shall use the CD player as a case study of the 
sophisticated source encoding/decoding and channel encoding/decoding methods that have 
been developed over the past few years for digital audio systems. 


7.6.1 Digital Audio in Telephone Transmission Systems 


Nearly all of the transmission of speech signals over telephone channels is currently digital. 
The encoding of speech signals for transmission over telephone channels has been a topic 
of intense research for over 50 years. A wide variety of methods for speech source encoding 
have been developed over the years; many of these methods are in use today. 

The general configuration for a speech signal encoder is shown in Figure 7.19. 
Because the frequency content of speech signals is limited to below 3200 Hz, the speech 
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Figure 7.19 A general speech encoder. 


signal is first passed through an antialiasing lowpass filter and then sampled. To ensure 
that aliasing is negligible, a sampling rate of 8000 Hz or higher is typically selected. The 
analog samples are then quantized and represented in digital form for transmission over 
telephone channels. 

PCM and DPCM are widely used waveform encoding methods for digital speech 
transmission. Logarithmic u = 255 compression, given by Equation (7.4.8), is generally 
used for achieving nonuniform quantization. The typical bit rate for PCM is 64, 000 bps, 
while the typical bit rate for DPCM is 32,000 bps. 

PCM and DPCM encoding and decoding are generally performed in a telephone 
central office, where telephone lines from subscribers in a common geographical area 
are connected to the telephone transmission system. The PCM or DPCM encoded speech 
signals are transmitted from one telephone central office to another in digital form over 
so-called trunk lines, which are capable of carrying the digitized speech signals of many 
subscribers. The method for simultaneous transmission of several signals over a common 
communication channel is called multiplexing. In the case of PCM and DPCM transmis- 
sion, the signals from different subscribers are multiplexed in time; hence, we have the 
term time-division multiplexing (TDM). In TDM, a given time interval Ty is selected 
as a frame. Each frame is subdivided into N subintervals of duration T;/N, where N 
corresponds to the number of users who will use the common communication channel. 
Then, each subscriber who wishes to use the channel for transmission is assigned a subin- 
terval within each frame. In PCM, each user transmits one 8-bit sample in each 
subinterval. l 

In digital speech transmission over telephone lines via PCM, there is a standard TDM 
hierarchy that has been established for accommodating multiple subscribers. In the first 
level of the TDM hierarchy, 24 digital subscriber signals are time-division multiplexed into 
a single high-speed data stream of 1.544 Mbps (24 x 64 kbps plus a few additional bits for 
control purposes). The resulting combined TDM signal is usually called a DS-1 channel. 
In the second level of TDM, four DS-1 channels are multiplexed into a DS-2 channel, 
each having the bit rate of 6.312 Mbps. In a third level of hierarchy, seven DS-2 channels 
are combined via TDM to produce a DS-3 channel, which has a bit rate of 44.736 Mbps. 
Beyond DS-3, there are two more levels of TDM hierarchy. Figure 7.20 illustrates the TDM 
hierarchy for the North American telephone system. 
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Figure 7.20 The TDM hierarchy forthe North American telephone system. 


In mobile cellular radio systems for transmission of speech signals, the available bit 
rate per user is small and cannot support the high bit rates required by waveform encoding 
methods, such as PCM and DPCM. For this application, the analysis-synthesis method 
based on the LPC, as described in Section 7.5, is used to estimate the set of model param- 
eters from short segments of the speech signal. The speech model parameters are then 
transmitted over the channel using vector quantization. Thus, a bit rate in the range of 
4800-9600 bps is achieved with LPC. 

In mobile cellular communication systems, the base station in each cell serves as 
the interface to the terrestrial telephone system. LPC speech compression is only required 
for the radio transmission between the mobile subscriber and the base station in any cell. 
At the base station interface, the LPC-encoded speech is converted to PCM or DPCM for 
transmission over the terrestrial telephone system at a bit rate of 64,000 bps or 32,000 bps, 
respectively. Hence, we note that a speech signal transmitted from a mobile subscriber to 
a fixed subscriber will undergo two different types of encoding; however, a speech signal 
communication between two mobiles serviced by different base stations, connected via the 
terrestrial telephone system, will undergo four encoding and decoding operations. 


7.6.2 Digital Audio Recording 


Audio recording became a reality with the invention of the phonograph during the second 
half of the nineteenth century. The phonograph had a lifetime of approximately 100 years, 
before it was supplanted by the CD that was introduced in 1982. During the 100-year 
period, we witnessed the introduction of a wide variety of records, the most popular of 
which proved to be the long-playing (LP) record that was introduced in 1948. LP records 
provide relatively high-quality analog audio recording. 

In spite of their wide acceptance and popularity, analog audio recordings have a 
number of limitations, including a limited dynamic range (typically about 70 dB) and a 
relatively low signal-to-noise ratio (typically about 60 dB). By comparison, the dynamic 
range of orchestral music is in the range of 100-120 dB. This means that if we record the 
music in analog form, at low music levels, noise will be audible and, if we wish to prevent 
this noise, saturation will occur at high music levels. 
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TABLE 7.4 COMPARISON OF AN LP RECORD AND A CD SYSTEM 


Specification/Feature LP Record CD System 


Frequency response 30 Hz-20 kHZ 20 Hz-20 kHz 
+3 dB +0.5/—1 dB 
Dynamic range 70 dB >90 dB 
1 kHz 


Signal-to-noise ratio 60 dB >90 dB 
| Harmonic distortion 1%-2% 0.005% 


Durability High frequency response Permanent 
degrades with playing 


Stylus life 500-600 hours 5000 hours 











Digital audio recording and playback allows us to improve the fidelity of recorded 
music by increasing the dynamic range and the signal-to-noise ratio. Furthermore, digi- 
tal recordings are generally more durable and do not deteriorate with playing time, as do 
analog recordings. Next, we will describe a CD system as an example of a commercially 
successful digital audio system that was introduced in 1982. Table 7.4 provides a compari- 
son of some important specifications of an LP record and a CD system. The advantages of 
the latter are clearly evident. 

From a systems point of view, the CD system embodies most of the elements of a 
modern digital communication system. These include analog-to-digital (A/D) and digital- 
to-analog (D/A) conversion, interpolation, modulation/demodulation, and channel cod- 
ing/decoding. A general block diagram of the elements of a CD digital audio system is 
illustrated in Figure 7.21. Now, we will describe the main features of the source encoder 
and decoder. 

The two audio signals from the left (L) and right (R) microphones in a recording 
studio or a concert hall are sampled and digitized by passing them through an A/D con- 
verter. Recall that the frequency band of audible sound is limited to approximately 20 kHz. 
Therefore, the corresponding Nyquist sampling rate is 40 kHz. To allow for some fre- 
quency guard band and to prevent aliasing, the sampling rate in a CD system has been 
selected to be 44.1 kHz. This frequency is compatible with video recording equipment that 
is commonly used for the digital recording of audio signals on magnetic tape. 

The samples of both the L and R signals are quantized using uniform PCM with 
16 bits/sample. According to the formula for SQNR given by Equation (7.4.4), 16-bit uni- 
form quantization results in an SQNR of over 90 dB. In addition, the total harmonic distor- 
tion achieved is 0.005%. The PCM bytes from the digital recorder are encoded to provide 
protection against channel errors in the readback process and passed to the modulator. 

At the modulator, digital control and display information is added, including a table 
of contents of the disc. This information allows for programmability of the CD player. 
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system. 
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Using a laser, the digital signal from the modulator is optically recorded on the sur- 
face of a glass disc that is coated with photoresist. This results in a master disc, which is 
used to produce CDs by a series of processes that ultimately convert the information into 
tiny pits on the plastic disc. The disc is coated with a reflective aluminum coating and then 
with a protective lacquer. 

In the CD player, a laser is used to optically scan a track on the ‘disc at a constant 
velocity of 1.25 m/sec and, thus, to read the digitally recorded signal. After the LandR 
signal are demodulated and passed through the channel decoder, the digital audio signal is 
converted back to an analog audio signal by means of a D/A converter. 

The conversion of L and R digital audio signals into the D/A converter has a precision 
of 16 bits. In principle, the digital-to-analog conversion of the two 16-bit signals at the 
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Figure 7.22 Oversampling and digital filtering. 


44.1 kHz sampling rate is relatively simple. However, the practical implementation of a 
16-bit D/A converter is very expensive. On the other hand, inexpensive D/A converters 
with 12-bit (or less) precision are readily available. The problem is to devise a method for 
D/A conversion that employs low precision and, hence, results in a low-cost D/A converter, 
while maintaining the 16-bit precision of the digital audio signal. 

The practical solution to this problem is to expand the bandwidth of the digital audio 
signal by oversampling through interpolation and digital filtering prior to analog conver- 
sion. The basic approach is shown in the block diagram given in Figure 7.22. The 16-bit L 
and R digital audio signals are up-sampled by some multiple U by inserting U — 1 zeros 
between successive 16-bit signal samples. This process effectively increases the sampling 
rate to U x 44.1 kHz. The high-rate L and R signals are then filtered by a finite-duration 
impulse response (FIR) digital filter, which produces a high-rate, high-precision output. 
The combination of up-sampling and filtering is a practical method for realizing a digital 
interpolator. The FIR filter is designed to have a linear phase and a bandwidth of approx- 
imately 20 kHz. It serves the purpose of eliminating the spectral images created by the 
up-sampling process and is sometimes called an antiimaging filter. 

If we observe the high sample rate and high precision of the L and R digital audio 
signals of the output of the FIR digital filter, we will find that successive samples are nearly 
the same; they differ only in the low-order bits. Consequently, it is possible to represent 
successive samples of the digital audio signals by their differences and, thus, to reduce 
the dynamic range of the signals. If the oversampling factor U is sufficiently large, delta 
modulation may be employed to reduce the quantized output to a precision of 1 bit/sample. 
Thus, the D/A converter is considerably simplified. An oversampling factor U = 256 is 
normally chosen in practice. This raises the sampling rate to 11.2896 MHz. 

Recall that the general configuration for the conventional delta modulation system is 
as shown in Figure 7.23. Suppose we move the integrator from the decoder to the input of 
the delta modulator. This has two effects. First, it preemphasizes the low frequencies in the 
input signal; thus, it increases the correlation of the signal into the delta modulator. Second, 
it simplifies the delta modulator decoder because the differentiator (the inverse system; 
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Figure 7.24 A sigma-delta modulator. 


required at the decoder is canceled by the integrator. Hence, the decoder is reduced to a 
simple lowpass filter. Furthermore, the two integrators at the encoder can be replaced by a 
single integrator placed before the quantizer. The resulting system, shown in Figure 7.24, 
is called a sigma-delta modulator (SDM). Figure 7.25 illustrates an SDM that employs a 
single digital integrator (first-order SDM) with a system function 


A(z) = ———.. 
@) 1-—z! 

Thus, the SDM simplifies the D/A conversion process by requiring only a 1-bit D/A fol- 
lowed by a gonventional analog filter (a Butterworth filter, for example) for providing 
antialiasing protection and signal smoothing. The output analog filters have a passband 
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Figure 7.25 A first-order sigma-delta modulator. 
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of approximately 20 kHz; thus, they eliminate any noise above the desired signal band. In 
modem CD players, the interpolator, the SDM, the 1-bit D/A converter, and the lowpass 
smoothing filter are generally implemented on a single integrated chip. 


7.7 THE JPEG IMAGE-CODING STANDARD 


The JPEG standard, adopted by the Joint Photographic Experts Group, is a widely used 
standard for lossy compression of still images. Although several standards for image com- 
pression exist, JPEG is by far the most widely accepted. The JPEG standard achieves very 
good-to-excellent image quality and is applicable to both color and gray-scale images. 
The standard is also rather easy to implement and can be implemented in software with 
acceptable computational complexity. 

JPEG belongs to the class of wansform-coding techniques, i.e., coding techniques 
that do not compress the signal (in this case, an image signal) directly, but compress the 
transform of it. The most widely used transform technique in image coding is DCT (dis- 
crete cosine transform). The major benefits of DCT are its high degree of energy com- 
paction properties and the availability of a fast algorithm for computation of the transform. 
The energy compaction property of the DCT results in transform coefficients, in which 
only a few of them have significant values, so that nearly all of the energy is contained in 
those particular components. 

The DCT of an N x N picture with luminance function x(m, n),0 < mn<N-1 
can be obtained with the use of the following equations: 


N-1N-1 


X00) =— YD, 7.1) 


k=0 [=0 


N-1N-1 
X(u, v) = >> Sox 1) cos [Ss] cos IS 


k=0 1=0 


| u,v £0. (7.7.2) 


The X (0, 0) coefficient is usually called the DC component, and the other coefficients are 
called the AC components. 

The JPEG encoder consists of three blocks: the DCT component, the quantizer, anc 
the encoder, as shown in Figure 7.26. 


The DCT Component. A picture consists of many pixels arranged in an m xr 
array. The first step in DCT transformation of the image is to divide the picture array int 
8 x 8 subarrays. This size of the subarrays has been chosen as a compromise of complexity 
and quality. In some other standards, 4 x 4 or 16 x 16 subarrays are chosen. If the number o 
rows or columns (m or n) is not a multiple of 8, then the last row (or column) is replicatec 
to make it a multiple of 8. The replications are removed at the decoder. l 

After generating the subarrays, the DCT of each subarray is computed. This proces 
generates 64 DCT coefficients for each subarray starting from the DC component X (0, 0 
and going up to X (7, 7). The process is shown in Figure 7.27. 
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Figure 7.26 The block diagram of a JPEG encoder and decoder. 
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Figure 7.27 The DCT transformation in JPEG. 
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TABLE 7.5 QUANTIZATION TABLE FOR JPEG 


61 
55 
56 
62 
77 
92 





101 








The Quantizer. Due to the energy compaction property of the DCT, only low- 
frequency components of the DCT coefficients have significant values. 

Since the DC component carries most of the energy and since there exists a strong 
correlation between the DC component of a subarray and the DC component of the preced- 
ing subarray, a uniform differential quantization scheme is employed for quantization of 
DC components. The AC components are quantized using a uniform quantization scheme. 
Although all components are quantized using a uniform scheme, different uniform quanti- 
zation schemes use different step sizes. All quantizers, however, have the same number of 
quantization regions, namely, 256. 

A 64-element quantization table determines the step size for uniform quantization of 
each DCT component. These step sizes are obtained using psychovisual experiments. The 
result of the quantization step is an 8 x 8 array with nonzero elements only in the top left 
comer and many zero elements in other locations. A sample quantization table illustrating 
the quantization steps for different coefficients is shown in Table 7.5. 

After the quantization process, the quantized DCT coefficients are arranged in a vec- 
tor by zigzag sampling, as shown in Figure 7.28. Using this type of sampling, we obtain < 
vector X of length 64 with nonzero values only in the first few components. 


The Encoding. The quantization step provides lossy compression of the imag 
using the method described previously. After this step, entropy coding (as will be discussex 
in Chapter 12) is employed to provide lossless compression of the quantized values. Oni 
of the entropy-coding methods specified in the JPEG standard is Huffman coding; this wil 
be discussed in Section 12.3.1. In this case, Huffman codes are based on tables specifyin; 
code words for different amplitudes. Since the quantized subarrays contain a large numbe 
of zeros, some form of runlength coding is used to compress these zeros. Refer to th 
references at the end of this chapter for further details. 


Compression and Picture Quality in JPEG. Depending on the rate, JPE( 
can achieve high-compression ratios with moderate-to-excellent image quality for bot 
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eo 


ye Y Figure 7.28 Zigzag sampling of the 


AC35 ACe3 DCT coefficients. 


gray-scale and color images. At rates of 0.2-0.5 bits/pixel, moderate-to-good quality 
pictures can be obtained that are sufficient for some applications. Increasing the rate to 
0.5-0.75 bits/pixel results in good-to-very“good quality images that are sufficient for many 
applications. At 0.75-1.5 bits/pixel, excellent quality images are obtained that are sufficient 
for most applications. Finally, at rates of 1.5-2 bits/pixel, the resulting image is practi- 
cally indistinguishable from the original. These rates are sufficient for the most demanding 
applications. 





7.8 SUMMARY AND FURTHER READING 


The focus of this chapter was on the conversion of analog signals to digital form. We 
began by describing the sampling theorem for bandlimited signals. We demonstrated that 
by sampling an analog signal with bandwidth W at the minimum rate of 2W samples per 
second, it is possible to reconstruct the analog signal from its samples with no loss in 
fidelity or information. This minimum sampling rate of 2W samples per second is called 
the Nyquist rate. 

The second step in the conversion of an analog signal to digital form is quantization 
of the samples to a set of discrete amplitude levels. The simplest form of quantization is 
scalar quantization, where each sample is quantized separately. A scalar quantizer can per- 
form either uniform quantization or nonuniform quantization. We described both methods 
and characterized the performance of a uniform quantizer and a nonuniform quantizer in 
terms of the signal-to-quantization-noise ratio (SQNR). We also described vector quanti- 
zation, in which a block of k samples is jointly quantized. In general, vector quantization 
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- results in superior performance compared to scalar quantization, and is widely. used in 
speech and image digital signal processing. 
The third and final step in the conversion of an analog signal to digital form is encod- 
ing. In the encoding process, a sequence of bits is assigned to different quantization values 
We also described several waveform coding schemes which are designed to repro- 
duce the output waveform from a source at the destination with as little distortion as possi- 
_ ble. These methods include both uniform and nonuniform pulse code modulation (PCM) 
differential pulse code modulation (DPCM), delta modulation (DM), and adaptive delt: 
modulation (ADM). Another waveform encoding method is based on construction of ¢ 
model for the analog source and using linear prediction to estimate the model parameters 
which are transmitted to the receiver. In turn, the receiver uses the model parameters tc 
reconstruct the source and generate the source output. This method is called an analysis- 
synthesis technique, where the analysis is performed at the transmitter to estimate the 
| model parameters and the synthesis is performed at the receiver to construct the mode 
iji for the source and generate the source output. This technique is widely used in speecl 
coding. 

In the last two sections of this chapter, we presented applications of analog-to-digita 
conversion in digital audio transmission in telephone systems, digital audio recording fo 
the compact disc, and image coding based on the JPEG standard. 

Jayant and Noll (1984) and Gersho and Gray (1992) examine various E 
and waveform coding techniques in detail. Gersho and Gray (1992) include a detailed treat 
ment of vector quantization. Analysis—synthesis techniques and linear-predictive codin; 
| are treated in books on speech coding, specifically Markel and Gray (1976), Rabiner anı 
Schafer (1979), and Deller, Proakis, and Hansen (2000). The JPEG standard is describe 
in detail in the book by Gibson, et al. (1998). 
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PROBLEMS 








| 7.1 Assume x(t) has a bandwidth of 40 kHz. 





1. What is the minimum sampling rate for this signal? - 
2. What is the minimum sampling rate if a guard band of 10 kHz is required? 


3. What is the maximum sampling interval for the signal x(t): 
x(t) cos(80,000x1t)? 





7.2 For a lowpass signal with a bandwidth of 6000 Hz, what is the minimum samplin 
frequency for perfect reconstruction of the signal? What is the minimum require 
sampling frequency if a guard band of 2000 Hz is required? If the reconstructic 
filter has the frequency response 


K |f| < 7000 
H(f)=)K—K'£™ 7000 < |f| < 10,000 , 
0 otherwise 
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what is the minimum required sampling frequency and the value of K for perfect 
reconstruction? 


7.3 Let the signal x(t) = Asinc(1000r) be sampled with a sampling frequency of 2000 
samples/sec. Determine the most general class of reconstruction filters for perfect 
reconstruction of this signal. 


7.4 The lowpass signal x(t) with a bandwidth of W is sampled with a sampling interval 
of T;, and the signal 


oo 


xp(t)= ` x(Ts)p(t —nTs) 


n=—OO 


is reconstructed from the samples, where p(t) is an arbiwary-shaped pulse (not nec- 
essarily time limited to the interval [0, T;]). 


1. Find the Fourier wansform of x, (ft). 


KD 


2. Find the conditions for perfect reconstruction o fa) for. x p KON 


3. Determine the required reconstruction filter. S 


7.5 The lowpass signal x(t) with a bandwidth of W is sampled at the Nyquist rate, and 
the signal 


m(t) = X (-1"x(n%)8(¢ —nT;) 


is generated. 


£ 


1. Find the Fourier transform of x (t). 


2. Can x(t) be reconstructed from x; (t) by using a linear time-invariant system? 
Why? 


3. Can x(t) be reconstructed from x(t) by using a linear time-varying system? 
How? 


7.6 A lowpass signal x(t) with bandwidth W is sampled with a sampling interval T;, and 
the sampled values are denoted by x (nT;). A new signal x, (t) is generated by linear 
interpolation of the ja values, i.e., 








x(t) = x(nTs) +! T an +1)Ts)—x(nTs)) nT; StS (n+ DTI. 


TERRES AE OED PPY ETARE RSE O T TEA ARREN E 


1. Find the power spectrum of x; (t). 





2. Under what conditions can the original signal be reconstructed from the sampled 
signal and what is the required reconstruction filter? 
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7.7 A lowpass signal x(t) with bandwidth of 50 Hz is sampled at the Nyquist rate and 
the resulting sampled values are 
-1 -4<n<0 
x(nT;) = 1 O<n<4 
0 otherwise 


1. Find x(.005). 


2. Is this signal power type or energy type? Find its power or energy content. 
78 Let W be arbiwary and x(t) be a lowpass signal with a bandwidth W. j 


1. Show that the set of signals {¢,,(t)}72_,,, Where ¢, = sinc(2Wt — n) represents 
an orthogonal signal set. How should these signals be weighted to generate an 
orthonormal set? 


2. Conclude that the reconstruction from the samples relation 


x(t) = > x(nT,)sinc(2Wt — n) 


n=—00 
is an orthogonal expansion relation. 
3. From Part 2, show that for all n 


ie x(t)sinc(2Wt — n) dt = Kx(nT,), 


foe) 


and find K. 


N, 
E TA 


7.9 Let X (t) denote a wide-sense stationary (WSS) Gaussian process with Py = 10. 


© 


1. Using Table 7.1, design a 16-level optimal uniform quantizer for this source. 
2. What is the resulting distortion if the quantizer in Part 1 is employed? 


3. What is the amount of improvement in SQNR (in decibels) that results from 
doubling the number of quantization levels from 8 to 16? 


7.10 Using Table 7.1, design an optimal quantizer for the source given in Example 7.2.1. 
Compare the distortion of this quantizer to the distortion obtained there. 


7.11 Solve Problem 7.9 by using Table 7.2 instead of Table 7.1 to design an optimal 
nonuniform quantizer for the Gaussian source. 


7.12 Consider the encoding of the two random variables X and Y, which are uniformly 
distributed on the region between the two squares, as shown in Figure P-7.12. 
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Figure P-7.12 


1. Find fx(x) and fy(y). 


2. Assume each of the random variables X and Y are quantized using four-level 
uniform quantizers. What is the resulting distortion? What is the resulting num- 
ber of bits per (X, Y) pair? 


3. Now assume that instead of scalar quantizers for X and Y, we employ a vector 
quantizer to achieve the same level of distortion as in Part 2. What is the resulting 
number of bits/source output pair (X, Y)? 


7.13 Two random variables X and Y are uniformly distributed on the square shown in 
Figure P-7.13. 





Figure P-7.13 
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1. Find fx(x) and fy(y). 

2. Assume that each of the random variables X and Y are quantized using four- 
level uniform quantizers. What is the resulting distortion? What is the resus? 
number of bits per (X, Y) pair? 


3. Now assume that instead of scalar quantizers for X and Y, we employ a vector 


quantizer with the same number of bits/source output pair (X, Y) as in Part 2. 
What is the resulting distortion for this vector quantizer? 


7.14 Solve Example 7.4.1 when the samples are uniformly distributed on [—2, 2]. 


7.15 A stationary random process has an autocorrelation function given by Rx(t) = 
2 _ r 
£ emri cos 21 fot; we know that the random process never exceeds 6 in magnitude. 
Assume that A = 6. 


1. How many quantization levels are required to guarantee an SQNR of at least 
60 dB? 

2. Assuming that the signal is quantized to satisfy the condition of Part 1 and 
assuming the approximate bandwidth of the signal is W, what is the minimum 
required bandwidth for the transmission of a binary PCM signal based on this 

rc quantization scheme? 


x 7.16.,A signal can be modeled as a lowpass stationary process X(t), whose probability 
Shu density function (PDF) at any time to is shown in Figure P-7.16. 


Fic(x) į 


Figure P-7.16 


The bandwidth of this process is 5 kHz, and we desire to transmit it using a PCM 
system. 


1. If sampling is done at the Nyquist rate and a uniform quantizer with 32 levels is 
employed, what is the resulting SQNR? What is the resulting bit rate? 


2. If the available bandwidth of the channel is 40 kHz, what is the highest achiev- 
able SQNR? 

3. If, instead of sampling at the Nyquist rate, we require a guard band of at leas: 
2 kHz and the bandwidth of the channel is 40 kHz, what is the highest achievable 
SQNR? 
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7, 17 A-Stationary source is distributed according to a triangular PDF, fx(x) = tA ( 5): 
/This source is quantized using the four-level uniform quantizer 


1.5, l<x<2 
0.5, O<x<l 
—0.5, -l<x<0 
—1.5, —2<x<-l 


Q (x) = 


Determine the PDF of the random variable representing the quantization error, i.e., 
X =X-Q(X). 


7.18 The random process X (t) is defined by X (t) = Y cos(2nfot+0), where Y and © are 
two independent random variables, Y uniform on [—3, 3] and © uniform on [0, 27]. 


1. Find the autocorrelation function of X(t) and its power spectral density. 


2. If X(t) is to be transmitted to maintain an SQNR of at least 40 dB using a uni- 
form PCM system, what is the required number of bits/sample and the least 
bandwidth requirement (in terms of fo)? 


3. If the SQNR is to be increased by 24dB, how many more bits/sample must be 
introduced, and what is the new minimum bandwidth requirement in this case? 


7.19 A zero-mean, WSS random process X (t) with autocorrelation function of 
Rx(t) = 2sinc? (104r) 


is applied to a PCM system. We also know that X(t) has a maximum value of 10. 
The quantizer in the PCM system uses N = 128 levels. 


1. If a guard band of 2.5 kHz is used for sampling, what is the resulting bit rate 
(R), SQNR (in dB), and the minimum required transmission bandwidth (Br)? 

2. If the minimum required SQNR is 56 dB, what is the required N and the mini- 
mum required wansmission bandwidth (assuming the same 2.5 kHz guard band 
as in part 1)? What will be the resulting SQNR (in dB) of the designed system? 

3. If a total wansmission bandwidth of 93 kHz is available, what is the highest 


achievable SQNR (in dB)? What are the resulting system parameters (Wg, N 
and f,) and the transmission rate? 


7.20 X (t) denotes a zero-mean WSS Gaussian random process with autocorrelation func- 
tion 


Rx(t) = 4sinc?(10*r) 
1. What is the power in this process? 


2. Determine the power spectral density, S,(f), for this process. 


3. What is the bandwidth of this process? 
» 
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4, Assuming that this process passes through an ideal lowpass filter with 2 band- 
width of 5 kHz and the output is denoted by Y(t), determine S,(f), the power 
spectral density of Y(t), and the total power in the output process. 


5. Determine the PDF (probability density function) of random variables X (0), 
X (107), and X (1.5 x 1074). 


6. Show that random variables X (0) and X(10~*) are independent but X (0) and 
X(1.5 x 1074) are dependent. 


7.21 In a PCM system, the signal X (t) has a bandwidth of 5 kHz. This signal is sampled 
is with a guard band of 2 kHz. It is known that the sampled values have a probability 
density function shown below. 


| Fx) 
| 


ale wl 


2 





The quantizer uses N = 128 levels. After quantization, the quantized levels are 
binary encoded and transmitted. 


1. What is the resulting bit rate in bits per second? 

2. What is the resulting SQNR (in dB)? 

3. What is the required transmission bandwidth? 

4. If the available transmission bandwidth is 70 kHz, what is the maximum achiev- 
able SQNR (in dB)? 


7.22 The power spectral density of a WSS information source is shown below (the unit of 
power spectral density is W/Hz). The maximum amplitude of this signal is 200. 





f(z) 


-3000 -2000 2000 3000 
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7.23 


7.24 





1. What is the power in this process? 


2. Assume that-this signal is wansmitted using a uniform PCM system with 512 
quantization levels, what is the resulting SQNR in decibels and what is the min- 
imum required transmission bandwidth if in sampling the signal a guard band of 
1 kHz is used? 


3. If the available transmission bandwidth is 47 kHz, design a PCM system that 
achieves the highest possible SQNR and utilizes the maximum guard band. 


A zero-mean stationary information source X (t) has a power spectral density given 
by 


1 1 

———; [fl <200Hz 
Sx(f) = wit f 4 

0 otherwise 


The amplitude of this source is limited to 10 in magnitude. This source is sampled 
at the Nyquist rate and the samples are coded using an 8 bit/sample uniform PCM 
system. 


1. Determine the resulting SQNR in decibels. 


2. If we want to increase the SQNR by at least 20 dB, how should the required 
number of quantization levels change? 


3. In Part 2, what is the minimum required bandwidth for the transmission of the 
PCM signal? 


(Hint: 4 arctan x = 
x/2.) 


TE andforx > 20, you can use the approximation arctan x ~% 


Signal X (t) has a bandwidth of 12,000 Hz, and its amplitude at any time is a random 
variable whose PDF is shown in Figure P-7.24. We want to transmit this signal using 
a uniform PCM system. 


Figure P-7.24 


Show that a = i. 
Determine the power in X (t). 
What is the SQNR in decibels if a PCM system with 32 levels is employed? 


What is the minimum required transmission bandwidth in Part 3? 


ae pe 
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If we need to increase the SQNR by at least 20 dB, by how much should the 
transmission bandwidth increase? 


7.25 The power spectral density of a zero-mean WSS random process X (t) is given by 


5000 
LEL —5000 < f <0 
5000 
Sx(f) = { —f + 5000 0< f <5000 ° 
5000 
0 otherwise 


and the maximum amplitude of this process is 600. 


Un 


AN 
p 
va 
/ 


. Whatis the power content ofthis process? 


. If this process is sampled at rate f; to guarantee a guard band of 2000-Hz, what 


is fs? 


. If we use a PCM system with 256 quantization levels on this process (sampled 


at the rate you found in Part 2), what is the resulting SQNR (in decibels)? 


. In Part 3, what is the resulting bit rate? 
. If the output of the PCM system is to be transmitted using a binary system, what 


is the required minimum transmission bandwidth? 


. If we need to increase the SQNR by at least 25 dB, what is the required num- 


ber of quantization levels, the resulting SQNR, and the required transmission 
bandwidth? 


is In our analysis of PCM systems, we always assumed that the transmitted bits were 


~ received with no errors. However, practical channels cause errors. Let us assume that 
the output of a PCM system is transmitted via a channel whose error probability is 
denoted by P2. We further assume that P, is small enough such that, in transmission 
of the v bits resulting from encoding of each quantized sample, either no error occurs 
or, at most, one error occurs. This means that the probability of each transmitted bit 
being in error is P2, and the probability of no error in transmission of v bits is roughly 
1 — vP,. We also assume that for the binary representation of each quantized value, 
natural binary coding (NBC) is employed, i.e., the lowest quantized level is mapped 
into a sequence of zeros and the largest level is mapped into a sequence of all ones, 
and all the other levels are mapped according to their relative value. 


1. 


Show that, if an error occurs in the least significant bit, its effect on the quan- 
tized value is equivalent to A, the spacing between the levels; if an error occurs 
in the next bit, its effect on the quantized value is 2A; ..., if an error occurs in 
the most significant bit, its effect on the quantized value is 2”7! A. 


. From Part 1, show that the mean squared error resulting from channel errors is 


given by i 
24-1 
Denannel = P2A T3 ’ 





where A = Sen = aa is the spacing between adjacent levels. 
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3. From Part 2, conclude that the total distortion, which is the sum of the quanti- 
zation distortion and the transmission distortion due to channel errors, can be 
expressed by 





2 

x 2 x 

Doai = raya (1 + 4Po(N? — 1) = PRL APR — 1). 

4. Finally, show that the SNR, defined as the ratio of the signal power to the total 


noise power, is given by 


32x? 3 x 4X? 
~ 14+4P)(N2—1) 144P,(4"—1)’ 
where 
Pai 
Xmax 


7.27 In a CD player, the sampling rate is 44.1 kHz, and the samples are quantized using a 
16 bit/sample quantizer. Determine the resulting number of bits for a piece of music 
with a duration of 50 minutes. 





‘COMPUTER PROBLEMS 


7.1 Determining the Centroids 


Use MATLAB to determine the centroids of the quantization regions for a zero- 
mean, unit-variance Gaussian distribution, where the boundaries of the quantization 
regions are given by (—5, —4, —2, 0, 1, 3, 5). The Gaussian distribution is given in 
the m-file normal .m. Although the support of the Gaussian distribution is (—oo, oo), 
for practical purposes, it is sufficient to use a range that is many times the standard 
deviation of the distribution. For example, (m — 100, m + 100), where m is the mean 
and ø is the standard deviation (ø? is the variance) of the Gaussian random variable. 


7.2 Uniform Quantizer Distortion 


The objective of this problem is to use MATLAB to determine the mean squared error 
for a uniform quantizer with 12 quantization levels, each of length 1, designed for 
a zero-mean Gaussian source with o? = 4. The quantization regions are symmetric 
with respect to the mean of the distribution. 


1. Specify the boundaries of the quantization regions. 

2. Specify the 12 quantization regions. 

3. Determine the 12 quantization values corresponding to the quantization regions 
and the resulting mean squared distortion. 


7.3 Design of Lloyd—Max Quantizer 


The objective of this problem is to use MATLAB to design a 10-level Lloyd—Max 
(nonuniform) quantizer for a zero-mean, unit-variance Gaussian source. 
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1. Determine the quantization boundaries and the quantization levels. 


2. Determine the mean squared distortion. 


7.4 Uniform PCM 


The objective of this exercise is to investigate the error in the quantization of a sinu- 
soidal signal using uniform PCM. 


1. Quantize the sinusoidal signal 
sit) =sint, O0<t<10 


once to 8 levels and once to 16 levels for a sampling interval Ts = 0.1. Plot the 
original signal and the two quantized versions on the same graph, and observe 
the results. j 


2. Compute the SQNR for the 8- and 16-level quantizers. 


7.5 Quantization Error in Uniform PCM 


The objective of this problem is to evaluate the quantization error in quantizing < 
Gaussian source using uniform PCM. 


1. Generate 500 zero-mean, unit-variance Gaussian random variables and quantize 
them by using a uniform 64-level PCM quantizer and encoder. Plot the 500-poin 
sequence generated. 


2. Determine the SQNR for the 64-level quantizer. 


3. Determine the first five values of the sequence, the corresponding quantized val 
ues, and the corresponding PCM code words. 


4. Plot the quantization error, defined as the difference between the input value an 
the quantized value, for the 500-point sequence. 


7.6 Nonuniform PCM 


The objective of this problem is to evaluate the quantization error in quantizing thi 
output of a Gaussian source with nonuniform PCM. 


1. Generate 500 zero-mean, unit-variance Gaussian random variables and quantiz 
them using a 16-, 64-, and 128-level quantizer and a u = 255 nonlinearity. Plc 
the input-output quantizer characteristic for each case -and the correspondin; 
error sequence. 


2. Determine the SQNR for each quantizer. 
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In Chapter 7, we described methods for converting the output of a signal source into a 
sequence of binary digits. In this chapter, we consider the transmission of the digital infor- 
mation sequence over communication channels that are characterized as additive white 
Gaussian noise (AWGN) channels. The AWGN channel is one of the simplest mathemat- 
ical models for varjous physical communication channels, including wirelines and some 
radio channels. Sych channels are basically analog channels, which means that the digital 
information sequénce to be transmitted must be mapped into analog signal waveforms. 

Our treatment focuses on the characterization and the design of analog signal wave- 
forms that carry digital information and their performance on an AWGN channel. First, we 
consider signal transmission through baseband channels, i.e., channels having frequency 
passbands that usually include zero frequency (f = 0). When the digital information is 
transmitted through a bas channel, there is no need to use a carrier frequency for the 
transmission of the digitally modulated signals. On the other hand, there are many com- 
munication channels (including telephone channels, radio channels, and satellite channels) 
that have frequency passbands that are far removed from f = 0. These types of chan- 
nels are called bandpass channels. In such channels, the information-bearing signal is 
impressed on a sinusoidal carrier, which shifts the frequency content of the information- 
bearing signal to the appropriate frequency band that is passed by the channel. Thus, the 
signal is transmitted by carrier modulation. 

Throughout this chapter, we study only one-shot communications, i.e., transmission 
of only one signal corresponding to a single message, with no transmission following this 
single transmission. We study sequential transmission of digital data and intersymbol inter- 
ference in Chapter 10. 
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We begin by developing a geometric representation of these types of signals, which 
is useful in assessing their performance characteristics. Then, we describe several differ- 
ent types of analog signal waveforms for transmitting digital information, and we give thei 
geometric representation. The optimum demodulation and detection of these signals is then 
described, and their performance in the AWGN channel is evaluated in terms of the prob- 
ability of error. In Chapter 9, we compare the various modulation methods on the basis ot 
their performance characteristics, their bandwidth requirements, and their implementatior 
complexity. 


8.1 GEOMETRIC REPRESENTATION OF SIGNAL WAVEFORMS 


In a digital communication system, the modulator input is typically a sequence of binary 
information digits. The modulator may map each information bit to be transmitted into on 
of two possible distinct signal waveforms, say sı (t) or s2(t). Thus, a zero is represented b' 
the transmitted signal waveform s;(t), and a one is represented by the transmitted signa 
waveform s2(t). This type of digital modulation is called binary modulation. Alternatively 
the modulator may transmit k bits (k > 1) at atime by employing M = 2* distinct signa 
waveforms, say Sm(t), 1 < m < M. This type of digital modulation is called M-ar 
(nonbinary) modulation. In this section, we develop a vector representation of such digita 
signal waveforms. Such a representation provides a compact characterization of signa 
sets for transmitting digital information over a channel, and it simplifies the analysis o 
their performance. Using vector representation, waveform communication channels ar 
represented by vector channels. This reduces the complexity of analysis considerably. 

Suppose we have a set of M signal waveforms s,,(t), 1 < m < M, which are to b 
used for transmitting information over a communication channel. From the set of M wave 
forms, we first construct a set of N < M orthonormal waveforms, where N is the dimer 
sion of the signal space. For this purpose, we use the Gram-Schmidt orthogonalizatio 
procedure. 

Gram-Schmidt Orthogonalization Procedure. We begin with the first wavefor 
sı (t), which is assumed to have energy 1. The first waveform of the orthonormal set : 
constructed simply as 





si(t) 
Ve 
Thus, y(t) is simply s,(¢) normalized to unit energy. 


The second waveform is constructed from s2(t) by first computing the projection « 
$2(t) onto W(t), which is 


WO = (8.1. 





C21 = i So(t) vi (t)dt. (8.1.: 


Then, c21¥1(t) is subtracted from s2(t) to yield 


dz(t) = s2(t) — ca Wilt). (8.1. 
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Now, d2(t) is orthogonal to y (t), but it does not possess unit energy. If 62 denotes the 
energy in d2(t), then the energy-normalized waveform that is orthogonal to y (t) is 


a(t) 


t) = — =, 8.1.4 
Yr(t) JE (8.1.4) 
o0 
b2 = l d3(t) dt. (8.1.5) 
—0o 
In general, the orthogonalization of the kth function leads to 
d,(t) 
th = =, 8.1.6 
Vi Je (8.1.6) 
where 

k-1 
de(t) = (t) — D> cuwil), (8.1.7) 

i=l 

[o.e} 
C= f d(t)dt, (8.1.8) 
-œ 
and 
o0 
Chi =f sOi (dt, i=1,2,..k—-L. (8.1.9) 
—00 


Thus, the orthogonalization process is continued until all the M signal waveforms {Sm(t)} 
have been exhausted and N < M orthonormal waveforms have been constructed. If at any 
step d,(t) = 0, then there will be no new y(t); hence, no new dimension is introduced. 
The N orthonormal waveforms {y,,(t)} form an orthonormal basis in the N-dimensional 
signal space. The dimensionality N of the signal space will be equal to M if all the M 
signal waveforms are linearly independent, i.e., if none of the signal waveforms is a linear 
combination of the other signal waveforms. 


Example 8.1.1 
Let us apply the Gram-Schmidt procedure to the set of four waveforms illustrated in Fig- 
ure 8.1(a). The waveform sı(t) has energy 6, = 2, so that y(t) = 51(t)//2. Next, we 
observe that c21 = 0, so that Y; (t) and s2(t) are orthogonal. Therefore, w(t) = s2(t)/ VEA = 
sa(t)/ /2.Toobtain wW3(t), we compute c3; and c32, whicharec3; = 0 and c32 = —4/2. Hence, 


dt) = 53(t) + V2yn(2). 


Since d;(t) has unit energy, it follows that #3(t) = d3(t). Finally, we find that c41 = v32, 
cy = 0, C43 = 1. Hence, 


dalt) = sa(t) — V24 (2) — a(t) = 0. 
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s(t) 4} 53(2) | 





(a) Original signal set 


ww) + p(t) rn 








(b) Orthonormal waveforms 


Figure 8.1 Application of the Gram-Schmidt orthogonalization procedure to signals {s;(t)}. 


Thus, s4(t) is a linear combination of yı (t) and w3(t); consequently, the dimensionalityfof the 
signal set is N = 3. The functions Y7 (t), Y(t), and y3(t) are shown in Figure 8.1(b) ‘a 
ress 


Once we have constructed the set of orthogonal waveforms {Wn (t)}, we can exp 
the M signals {Sm(t)} as exact linear combinations of the (Hm). Hence, we may "S 


N 
Smt) = Y Smalt),  m=1,2,...M, (8.1.10) 


n=1 


where the weighting coefficients in this linear combination are given as 


Smn -f Sm(t) Wn (t) dt. (8.1.11) 


foe) 
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Since the basis functions {y,,(¢)} are orthonormal, the energy of each signal waveform is 
related to the weighting coefficients as follows: 


3 N 
Em = J s? (t) dt = Sa: 
i m 2 m: 


On the basis of expression in Equation (8.1.10), each signal waveform may be rep- 
resented by the vector 


Sm = (Smi, Sm2s+++5SmN)s (8.1.12) 
or, equivalently, as a point in N-dimensional signal space with coordinates {Smi,i = 


1,2, ..., N}. We can show that the inner product of two signals is equal to the inner product 
of their vector representations, i.e., 


foe) 
f Sm(t)Sn(t) dt = Sm'Sn- (8.1.13) 
— o0 
and, in particular, when we substitute m = n in Equation (8.1.13), we obtain 
[o) 
En = / s? (®©) dt = |lSmll?; (8.1.14) 
=O 


i.e., the energy of the mth signal waveform is simply the square of the length of the vector 
or, equivalently, the square of the Euclidean distance from the origin to the point in the 
N-dimensional space. It should also be noted that since selection of the orthonormal basis 
is not unique, the vector representation of signal waveforms is not unique either. However, 
the dimensionality of the the signal space N and the length of the vectors representing the 
signals are independent of the selected orthonormal basis. 


Example 8.1.2 
Let us determine the vector representations of the four signals shown in Figure 8.1(a) by using 
the orthonormal set of functions in Figure 8.1(b). Since the dimensionality of the signal space 
is N = 3, each signal is described by three components, which are obtained by projecting each 
of the four signal waveforms on the three orthonormal basis functions y(t), w2(t), W(t). 
Thus, we obtain sı = (/2,0, 0), s2 = (0, V2, 0), s3 = (0, —v/2, 1), s4 = (/2, 0, 1). These 
signal vectors are shown in Figure 8.2. a 


Y(t) 





Figure 8.2 Signal vectors corresponding to the signals 
W(t) s(t), i = 1,2,3,4. 
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MIOR w(t) 4 





Figure 8.3 Alternate set of basis functions. 







Finally, we should observe that the set of basis functions {w,,(t)} obtained by th: 
—Schmidt procedure is not unique. For example, another set of basis functions tha 
the three-dimensional space is shown in Figure 8.3. For this basis, the signal-vector, 


Gra 


the léngths (energies) of the signal vectors, or the inner product of any two vectors. £ 
changen the basis is essentially a rotation and/or reflection of the signal points around th 
origin. 

Although the Gram-Schmidt orthogonalization procedure is guaranteed to generat 
an orthonormal basis for representation of the signal set, in many cases, including the pre 
ceding example, it is easier to use a method based on inspection to generate the orthonorma 
basis. We explore this method in the problems at the end of this chapter. 


8.2 BINARY MODULATION SCHEMES 


In this section, we consider two different binary modulation methods: binary antipodal sig 
naling and binary orthogonal signaling. We show that some of the most common binar 
modulation schemes, including binary pulse amplitude modulation (KAM), binar 
amplitude-shift keying (ASK), binary pulse position modulation (PP and bi 
frequency-shift keying (FSK), are special cases of these two modulation nfethods/ Wi 
assume that the information to be transmitted is a binary sequence that consists of zero 
and ones, and occurs at the bit rate R, bits/sec (bps). 


8.2.1 Binary Antipodal Signaling 


Binary antipodal signaling is the simplest digital modulation method. In isfpons th 
information bit 1 is represented by a pulse p(t) of duration T, and the information bi 
0 is represented by — p(t). Since one signal pulse is the negative of the other, this typ 
of signaling is called binary antipodal signaling and since each signal carries one bit o 
information, the bit interval T, is equal to signal duration T, i.e., T, = T. An exampl 
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s(t) aj s(t) 1 





Figure8.4 Binary PAM signals. 


of binary antipodal signaling is shown in Figure 8.4 where p(t) is a rectangular pulse 
of amplitude A. Because in this type of signaling binary information is transmitted by 
the amplitude of the pulse, this modulation scheme is also called binary pulse amplitude 
modulation. Pulses are transmitted at a bit rate Rẹ = 1/T, bits/sec, where T, is the bit 
interval. The pulse shape determines the spectral characteristics of the transmitted signal, 
as described in Chapter 10. 

In this particular example, the binary PAM signal waveforms are expressed as 


Sm(t) = Am8r(t), O<t<T,, m=1,2, (8.2.1) 


where A m takes one of two possible values (A for m = 1 and —A for m = 2), and gr (t) is 
a rectangular pulse of unit amplitude, as shown in Figure 8.5. The signal energy in each of 
the two waveforms is 


Th 
Em =f s2(t)dt, m=1,2 
0 


To 22 

=a? [ g(t) dt (8:2:2) 
0 
= A Tp. 


Hence, the two signal waveforms have equal energy, i.e., 6m = A?Tp, for m = 1, 2. Each 
signal waveform carries one bit of information. Therefore, we define the signal energy per 
bit of information as ,. Thus, we have A = ,/€,/Thp. 

The two signal waveforms in binary PAM have a very simple geometric representa- 
tion. The signal waveforms are expressed as 


Sm(t) = Sm (t), m=1,2, (8.2.3) 
gr (8) 
1 
0 T t Figure8.5 A rectangular pulse of unit amplitude and 
b 


duration 7,. 
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Figure 8.6 Unit energy basis function for binary PAM. 


- VB Ve, - VB; V3; 


' 
i 
s2 0 51 52 0 s1 








Figure 8.7 Geometric representation of binary PAM. 





where w(t) is the unit energy rectangular pulse shown in Figure 8.6, and s1 = /%; 
s2 = — 8p. We note that the binary PAM signal waveforms can be uniquely represente 
geometrically in one dimension (on the real line) as two vectors, and each has the ampli 
tude ./G,, as shown in Figure 8.7(a). For simplicity, we usually omit drawing the vector 
from the origin, and we simply display the two endpoints at v8, and —/€,, as shown i 
Figure 8.7(b). 


Example 8.2.1 


Consider the two antipodal signal waveforms shown in Figure 8.8. Show that these signa 
have exactly the same geometric representation as the two rectangular pulses in Figure 8.4. 








Solution Note that the two signal pulses have energy 6,. The waveforms of these signa 
may also be represented as 


Sm(t) = SmW(t), m= 1,2, \ 


sð s(t) | 























Figure 8.8 Binary antipodal signals in Example 8.2.1. 
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v(i) 


Figure 8.9 Unit energy basis function for the antipodal 
signals in Figure 8.8. 





where ¥(t) is the unit energy waveform shown in Figure 8.9, and sı = 65,52. = —/€p. 
Therefore, the two antipodal signal waveforms in Figure 8.8 have exactly the same geometric 
signal representation as those shown in Figure 8.4. a 


From this discussion, we conclude that any pair of antipodal signal waveforms can 
be represented geometrically as two vectors (two signal points) on the real line, where one 
vector is the negative of the other, as shown in Figure 8.7. The unit energy waveform, in 


general, is given by W(t) = Ter where ©, denotes the energy in p(t). 
P 





Binary Amplitude-Shift Keying. Binary ASK is a special case of binary antipo- 
dal signaling in which two baseband signals + p(t) are used to amplitude modulate a sinu- 
soidal carrier signal cos 27 fst, i.e., 


si(t) = p(t) cos2u fet, O<t< Th, 
(8.2.4) 
So(t) = — p(t) cos2uf.t, O<t < Tp. 


In the special case where p(t) is a rectangular pulse, the two signals can be expressed as 


26, 
silt) = ,/— cos2nf,t, O<t < Th, 
Ty 
26 
s(t) =- [= cos2n fet, O<t <Tp, 
= b 


where €, denotes the energy in sı (t) and s2(t), as shown in Problem 8.5. Plots of binary 
ASK signal waveforms for this case are shown in Figure 8.10. 

Geometric representation of binary ASKis similar to other binary antipodal schemes 
as shown in Figure 8.7, and the unit energy signal used for geometric representation of 
these signals is 


(8.2.5) 


vt) = [= cos 27 fet, O<t < Tp. (8.2.6) 
b 
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FIT, V26,/T, 


Figure 8.10 Binary ASK signal waveforms. 


The main difference between binary ASK and the binary antipodal scheme show 
in Figure 8.4 is in their spectral characteristics. The waveforms shown in Figure 8.4 ar 
baseband (lowpass) signals with their spectral content near f = 0, whereas the carrie 
modulated signals in Equation (8.2.4) are bandpass signals with spectrum near + f,. 


8.2.2 Binary Orthogonal Signaling 


Another type of pulse modulation that may be used to transmit a binary informatio 
sequence is binary orthogonal signaling. In binary orthogonal signaling, s;(¢) and s2(t 
have equal energy @, and are orthogonal, i.e., 


Ty 
f si(t) dt = 8p, 
0 
Tp ` 
f s(t) dt = 8p, l (8.2.7 
0 


To 
| sı(t)s2(t)dt = 0. 
0 


Since the two signals are already orthogonal, the task of finding an orthonormal bas: 
for representing them reduces to normalizing each signal, i.e., 


_ 81 (t) = 
nO =e | 
(8.2.8 


VE, 


Using this orthonormal basis, we can write s;(t) and s2(t) as 
s(t) = VE, W(t) + 0 W(t), 


{ (8.2. 
s2(t) = O Y1 (t) + V€ Walt) 
&- 


p(t) = 
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2 4VE, sh VE 





Figure 8.11 Geometric representation of 
(a) (b) binary orthogonal signal waveforms. 


resulting in 


sı = (V&,0), 
s2 = (0, V&), 


as shown in Figure 8.11(a) or as shown in Figure 8.11(b) as two signal points in two- 
dimensional space. We observe that the two signal vectors are perpendicular; hence, they 
are orthogonal, i.e., their dot product is equal to zero. 


(8.2.10) 


Binary Pulse Position Modulation. Binary pulse position modulation is an 
example of binary orthogonal signaling. In binary PPM, we employ two pulses that are 
different only in their location. Signal waveforms 5s) (t) and s2(t), which are shown in 
Figure 8.12, represent a binary PPM signaling scheme. Since these two signals are non- 
overlapping, they are orthogonal, i.e., 


Tp 
I sı(t)s2(t) dt = 0. (8.2.11) 
0 
As we may observe by inspection, the two signal waveforms have identical energies, i.e., 
Ty Tp 
B= [stoar= f stoat. (8.2.12) 
0 0 
51 (t) 52(2) 


V26,/Ts 





0 = Tyi 


Figure 8.12 Signal pulses in binary PPM (orthogonal signals). 
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y(t) p(t) 





Figure 8.13 Two orthonormal basis functions for binary PPM signals. 


In order to represent these two waveforms geometrically as vectors, we need tw 
orthonormal basis functions. These two waveforms, y(t) and w(t), are shown i 
Figure 8.13. Consequently, the signal waveforms sı (t) and s2(t) may be expressed as 


sı (£) = su Ya (t) + spr), 


2.1: 
s2(t) = s1 V1 (t) + sr¥r2(t), ak 
where we can easily observe that 
Ty 
Si =f sı(t) Y (t)dt = y 8p; 
0 
Tp 
sa= f si@vatedt =0; 
: (8:2.1. 


To 
ne Í s241 (t)dt = 0; 


Ti 
R Í " AOp Ddt = VE. 


The two signal waveforms are represented as two-dimensional vectors s; and s2 given t 
Equation (8.2.10) and shown in Figure 8.11. 


Binary Frequency-Shift Keying. Another example of binary orthogonal signe 
is binary frequency-shift keying. In binary FSK, the two signal waveforms are 


28 
sı(t) = £9 cos2nfit, 0<t< T, 
V 
[26 
s(t) = F008 a fat, 0<1t<T, r > 


with fı = kı/2Tp and f2 = k2/2T,, where kı and kz are distinct positive integers. It is cle 
that the frequency separation of the two sinusoidal signals is Af = |f2 — fil = m/2 
where m = |k2 — kı| is a positive integer. The signals are illustrated in Figure 8.14. It 
easy to verify that under these conditions,@; (t) and s2(t) have equal energy é, and tt 
they are orthogonal, i.e., (see Problem 8.6) I 


(8.2.1 
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V26,/T; V2%,/T, 
AA T, AA Tp 


Figure 8.14 Binary FSK signal waveforms. 


Tp Th 
f si(t)dt = f s3(t) dt = G, 
0 0 


and 
Tp 
f sı (t)s2(t)dt = 0. 
0 
The two basis waveforms for the binary FSK signals are 
5 
y(t) = ae cos2nfit, O<t<T, (8.2.16) 
VT, 
3 : 
y(t) = F cos2n fot, O<t < Tp. (8.2.17) 
\ 7, 
Hence, 
SE) =V Eep) and s(t) = V sylt). (8.2.18) 


Consequently, these binary FSK waveforms have the same geometric representation as 
shown in Figure 8.11 for binary orthogonal waveforms. 

The basic difference between binary FSK signals and the binary PPM signals is their 
spectral characteristics, which are shown in Figure 8.15. We observe that the frequency 
content of the PPM signals is centered around f = 0, whereas the binary FSK signals 


alen bl 


Figure 8.15 Spectral characteristics of binary PPM (left) and FSK (right) signals. 
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are bandpass signals. Hence, the PPM signals are baseband signals that are suitable fo 
transmission in baseband channels, whereas the FSK signals are bandpass signals that ar, 
suitable for transmission in bandpass channels. We will observe that, in spite of these differ 
ences in the spectral characteristics of the two signal types, their performance in channe] 
corrupted by additive white Gaussian noise (AWGN) is identical, simply because they hav 
identical geometric representations. 


Example 8.2.2 
Consider the two orthogonal signal waveforms shown in Figure 8.16. Show that these tw 
signal waveforms represent a binary orthogonal signaling scheme and have a geometric rey 
resentation similar to Figure 8.11. 


Solution By inspection, the two signal waveforms satisfy the orthogonality condition give 
by Equation (8.2.11), and they have energy @,. By using the orthonormal basis waveform 
Yı (t) and W(t) in Figure 8.13, the signal waveforms sj (t) and s, are expressed as 
























































51) = su Wi@ + salt) 
and 
‘Ti Si(t) = sY E) + swale), (8.2.1! 
| Al where 
To 
| Si =f sOy (dt = VE, /2; 
| 0 
To 
Si = Í si (Dy (t)dt = JS, /2; 
0 
l (8.2.2 
i To 
| sin = [sonar = JEP 
0 
i 
Tp 
! Sy =f 53(t)Wo(t)dt = —/S, /2. 
0 
it 
sO 
VEIT, 
>t 
0 
Figure 8.16 Two orthogonal signal waveforms. 
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si 


Figure 8.17 Signal vectors for the binary orthogonal waveforms shown 


S f 
2 in Figure 8.16. 


The vectors s| = (VE. V%,/2) and s} = (v82 -v8,/2) areshown in Figure 8.17. 
We observe that s} and s, are perpendicular (orthogonal vectors) and are simply a phase- 
rotated version of the orthogonal vectors shown in Figure 8.11 (a). 

Note that because sı (t) and s2(t) are orthogonal, we could simply normalize them and 
use the result as an orthonormal basis. Using this method, we would obtain vector representa- 
tions given in Equation (8.2.10) and geometric representation as in Figure 8.11. E 





8.3 OPTIMUM RECEIVER FOR BINARY MODULATED SIGNALS IN ADDITIVE WHITE GAUSSIAN 
NOISE 


In this section, we describe the signal processing operations performed at the receiver to 
recover the transmitted information. We begin by describing the channel that corrupts the 
transmitted signal by the addition of noise. l 


Additive White Gaussian Noise Channel. The communication channel is as- 
sumed to corrupt the transmitted signal by the addition of white Gaussian noise, as shown 
in Figure 8.18. Thus, the received signal in a signal interval of duration T, may be expressed 


as 
r(t) = Sp(t) + n(t), m= 1,2, (8.3.1) 
T id Channel 
ransmıtte 
i Received signal 
par (+) r(t) = Sp(t) + n(t) 


Noise 


n(t) 


Figure 8.18 Model for the received signal passed through an AWGN channel. 
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r(t) Signal Dees Decision on 
—— > ; 
Received demodulator transmitted signal 
signal 


Receiver 


Figure 8.19 Receiver for digitally modulated signals. 


where n(t) denotes the sample function of the AWGN process with the power spectra 
density Sa (f) = No/2 W/Hz. Based on the observation of r (t) over the signal interval, w 
wish to design a receiver that is optimum in the sense that it minimizes the probability o 
making an error. To be specific, we focus on the processing of the received signal r(t) i 
the interval 0 < t < Tp. 

It is convenient to subdivide the receiver into two parts, the signal demodulator an: 
the detector, as shown in Figure 8.19. In Section 8.4.1, we will show that such a subdivisio 
does not affect the optimality of the overall system. The signal demodulator’s function is t 
convert the received signal waveform r (t) into a vector y, whose dimension is equal to th 
dimension of the transmitted signal waveforms. The detector’s function is to decide whic 
of the two possible signal waveforms was transmitted; this decision is based on observatio 
of the vector y. 

Two realizations of the signal demodulator are described in Section 8.3.1 and Sec 
tion 8.3.2. The first is based on the use of signal correlators. The second is based on the us 
of matched filters. The optimum detector that follows the signal demodulator is designe: 
to minimize the probability of error. 





8.3.1 Correlation-Type Demodulator 


In this section, we describe the processing of the received signal by a correlation-typ 
demodulator for binary antipodal signals (binary PAM) and binary orthogonal signal 
(binary PPM). We begin with binary antipodal signals. 


Binary Antipodal Signals. Let us consider the binary antipodal signal generall 
represented as 


Sm(t) = SmW (t), m= 1, 2, ‘ (8.3.2 
where y(t) is the unit energy rectangular pulse w(t) = ie. For the special case wher 


P 
rectangular pulses are used, Y(t) is shown in Figure 8.6. We have seen that in binar 
antipodal signaling sı = v6» and s2 = —./%,. Therefore, the received signal is 


r(t)=smp()+n(t), O<t <T,, m=1,2. bss: 


In acorrelation-type demodulator, the received signal r (t) is multiplied by the sign: 
waveform y(t) and the product is integrated over the interval 0 < t£ < Tp, as illustrate 





a a a a Ra 
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r(t) : yO) / 
Received ` K ) á 
signal e f 


Sample at 


t=T, Figure 8.20 Cross correlator for binary 
y(t) antipodal signals. 


in Figure 8.20. We say that r (t) is cross correlated with w(t). Mathematically, this cross- 
correlation operation produces the output 


y(t) =Í r(t)w(t) dt 
=f [smy (T) + n(t)]W(t) dt (8.3.4) 


ot t 
= Sm [ w(t) dt + | n(tT)Y(T)dT. 
0 0 
We sample the output of the correlator at t = Tp. Thus, we obtain 


y(Th) = 5m +n, (8.3.5) 


where n is the additive noise term defined as 


Tp 
n =f W(t)n(t) dt. (8.3.6) 
0 


Since n(t) is a sample function of a white Gaussian noise process, the noise term n is a 
Gaussian random variable with zero mean and with variance 


Tb Tb 
a? = Eê) = | f E(nt)n(t) (Wa) didt 
0 0 
Ti T; 
Š f i ' Mat — YOY) ata (8.3.7) 
0 


0 
No [> 45 No 
= — tdt = —, 
>f VO) 5 


tal 


| where we have used Equation (5.3.3) for the autocorrelation funċtion of white noise. There- 


fore, for a given signal transmission (given Sm), the output of the correlator (y = y(Tẹ)) is 
a Gaussian random variable with mean s,, and variance No/2, i.e., 


1 2 z 
= — em MM m= 1,2. 8.3.8 
fOlSm) JaN, ( ) 


These two conditional probability density functions (PDFs) are illustrated in Figure 8.21. 
This correlator output is fed to the detector, which decides whether the transmitted bit is a 
zero or a one, as described in Section 8.3.3. 
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FOISm) 





Vé VE, 


Figure 8.21 The conditional probability density functions of the correlator output for binary antipodal 
signaling. 


Example 8.3.1 


Sketch the noise-free output of the correlator for the rectangular pulse w(t), as shown 
Figure 8.6, when s;(¢) and s2(t) are transmitted. 


Solution With z(t) = 0, the signal waveform at the output of the correlator is 
t t 
y= [sade = sq | Podr 
0 0 


The graphs of y(t) for sı = Js, and s2 = —,/%, are shown in Figure 8.22. Since 
pulse y(t) is constant over the integration interval 0 < t < Th, the correlator is just a sim 
integrator. We observe that the maximum signal at the output of the correlator occurs at t = 
We also observe that the correlator must be reset to zero at the end of each bit interval Tp, 
that it can be used in the demodulation of the received signal in the next signal interval. Si 
an integrator is called an integrate-and-dump filter 


Example 8.3.2 
If the binary antipodal signals illustrated in Figure 8.8 are used for the transmission of inf 
mation, demonstrate that the output of the correlation-type demodulator is exactly the same 
that given for the previously discussed rectangular pulse signals. 


Solution In this case, y(t) is the signal waveform shown in Figure 8.9, and the recer 
signal is 


r(t) = SmY (t) +n(t). 





Figure 8.22 Noise-free cross-correlator outputs when s ,(t) and s2(t) are transmitted. 
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The cross-correlator output is 


y(t) s r(t)y(t) dt 


Sa | y2(t)de + | n(c)w(t) dt; 
0 0 


att = Tp, we have 
y(Th) = Sm +n. 


The signal component s,, in the cross-correlator output is the same as that obtained for the rect- 
angular pulse, and the noise term n has exactly the same mean (zero) and variance 
o 2 = No/ 2: z 


Example 8.3.3 
The correlation-type demodulator can be implemented in discrete time by sampling the recei- 
ved signal waveform and cross correlating it with the sampled version of the possible trans- 
mitted signals. Let us consider antipodal signaling based on the rectangular pulse shown in 
Figure 8.5. The received signal waveform r (t) and the transmitted pulse s(t) are sampled at a 
rate of F, = 20/T, (the sampling interval is Ts = T,/20). The correlation of r(t) with s(t) 
performed numerically yields 


k 
1 
WRT) = 5 O TOTISAT:)  k=1,2,...,20 


nel 


Compute and plot y(kT;) for 1 < k < 20 when the transmitted signal is s(t) and —s (t), 
where s(t) is the rectangular pulse shown in Figure 8.5 and r(t) is noise free. Compare the 
plots with the continuous-time correlator outputs shown in Figure 8.22. 


Solution Figure 8.23 illustrates the noise-free outputs of the discrete-time correlator for the 
two antipodal signals. They are similar to the outputs of the continuous-time correlator. m 


Binary OrthogonalSignals. Let us consider the two orthogonal signals given by 
Equation (8.2.13) and illustrated in Figure 8.12, where y (t) and y(t) are the orthogonal 
basis functions as shown in Figure 8.13, and sı = (/€;, 0) and s2 = (0, v8») are the 
signal vectors. These two signals are used to transmit a binary information sequence, which 
consists of zeros and ones and occurs at a rate R, = 1/T; bits/sec. In the presence of 
AWGN, the received signal has the form 


r(t) = Sm(t)+n(t), O<t<T,, m=1,2. (8.3.9) 
Since the two possible transmitted signals are two-dimensional, the received signal 


r(t) is cross correlated with each of the two basis signal waveforms y(t) and y(t), as 
shown in Figure 8.24. The correlator output waveforms are 


t 
Ym(t) = J r(t)Wm(t) dt, m=1,2, (8.3.10) 
0 
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y4) 


Figure 8.23 The correlator output in 









Example 8.3.3. 

t 

j Y ) yi) eens to detector 

Sample at 
t= Te 
Received 
signal p(t) 

t 

f ( ) yo) EERE to detector 


Figure 8.24 Correlation-type demodulator for binary orthogonal signals. 
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which, when sampled at t = T, result in the outputs 


Tp 
RIDE f EET E 12, (8.3.11) 
0 


Now, suppose the transmitted signal is sı (t) = sy (t), so thatr(t) = sywi(t) + 
n(t). The output of the first correlator is 


Tb 
y = [s11 Y(T) +n(t)]yui(t) dt 
0 (8.3.12) 


= su +n = Ve +n, 


where 5}; = sz, is the signal component and n; is the noise component, defined as 


Th 
ni =f n(t)yı(t)dT. (8.3.13) 
0 
The output of the second correlator is 
T 
n= | ‘Isuth (t) +n(t)]W2(t) dt (8.3.14) 
Tp Tp 
=su f nawd + | n(t)Wo(t) dt (8.3.15) 
0 
Ty 
=f n(t)Wo(t) dt = n2. (8.3.16) 
0 


The output of the second correlator only includes the noise component n2, because y (t) 
and y(t) are orthogonal. Consequently, the received signal vector is 


y = (1, y2) (8.3.17) 


= (v€ + nna) l (8.3.18) 


It is easy to verify that when the signal s2(t) = s2242 (t) is transmitted, the outputs 
of the two correlators are yı = nı and y2 = S22 + n2 = Vp + n2. Hence, the received 
signal vector is 


y = (y1, y2) (8.3.19) 


= (m, VEn + n2) (8.3.20) 


The vector y at the output of the cross correlators is fed to the detector, which decides 
whether the received signal vector corresponds to the transmission of a one or a zero. 

The statistical characteristics of the observed signal vector y are easily determined. 
Since n(t) is a sample function of a white Gaussian noise process, the noise terms nı and 
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n are zero-mean Gaussian random variables with variance ae = No/2. Furthermore, th 
correlation between n and n2 is 


Tp Tp 
E(njn2) -=f Í E(n(t)n(t)) vi ()W2(t) dt dt 


To pt; 
=| f ' Mst - DW) dear (8.3.2) 


Il 


No f% 
z Vi(t)wa(t) dt = 0. 


0 


Therefore, nı and n2 are uncorrelated; since they are Gaussian, nı and nz are statist 
cally independent. Consequently, when the transmitted signal is sı (t), the conditional jou 
probability density function of the correlator output components (y1, y2) is 


1 z -(n- Sp No ZIN, 
fO y2ls1) = (x) € ea, (8.3.2: 
0 


When s(t) is transmitted, the conditional joint PDF of the correlator output componen 
(Yı; y2) is ; ; 
Je em e-(22=4 6) No (8.3.2: 


Since the noise components nı and nz are statistically independent, we observe th 
the joint PDFs of (1, y2), which are given by Equations (8.3.22) and (8.3.23), factor in 
a product of marginal probability density functions, i.e., 


FOr, Y2lSm) = f OilSm) fO2lSm), m= 1,2. (8.3.2 


For example, Figure 8.25 shows the PDFs f(y,|s,) and f (y2|s1) when sı (t) is transmitte 


Example 8.3.4 
Consider the two orthogonal signals shown in Figure 8.12. Assuming that the transmitt 
signal waveform is s2(t) = VEsvr (t), in the absence of additive noise, sketch the out 
waveforms of the two correlators shown in Figure 8.24. The other signal waveform is s; (t) 
VE; (t), where y(t) and y(t) are the basis functions shown in Figure 8.13. 





1 
fi, y2|S2) = (ax 





fols) fOr! sy) 


Yi: Y2 


0 VE; 


Figure 8.25 The conditional probability density functions of the outputs (yı, y2) from the cross correlators ı 
two orthogonal signals. 
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0 ü o T, T, 


Figure 8.26 Correlator output signal waveforms when s(t) is the transmitted signal. 


Solution When s(t) is transmitted in the absence of noise, the outputs of the two correlators 
shown in Figure 8.26 are 


y(t) = [ S(T) Y(t) dt = BS, [ vo(t) W(t) dt, 


y(t) = [ S(T) Y(T) dt = Ey i V3 (t) dr. 


The graphs of the correlator outputs are illustrated in Figure 8.26. Note that the noise-free 
output of the first correlator is zero for 0 < t < T, because y(t) and W(t) are nonoverlapping 
orthogonal waveforms. | 


Example 8.3.5 


The correlation-type demodulator for binary orthogonal signaling scheme in Figure 8.24 can 
be implemented in discrete time by sampling the received signal waveform and cross correlat- 
ing it with the sampled version of the possible transmitted signals. Let us consider orthogonal 
signaling based on the rectangular pulses shown in Figure 8.12, so that 


r(t) = s:(t) +n), i=1,2. 


The received signal waveform r(t) and the transmitted pulse sı (t) and s2(t) are sampled at a 
rate of F, = 20/T,. The discrete-time correlator outputs are (T, = T,/20) 


k 
1 
nET) = 55 2O r(nTsi (aT), 1 <k <20, 


k 
1 
y2(kTs) = 55 LNaT (eT), 1<k<20. 


Compute and plot y,(kT,) and y2(kT,) for 1 < k < 20 when s,(t) is transmitted and 
when and s»(t) is transmitted and r(t) is noise free. Compare the plots with the continuous- 
time correlator outputs shown in Figure 8.26. 


Solution Figure 8.27 illustrates the noise-free outputs of the discrete-time correlator for the 
two antipodal signals. They are similar to the outputs of the continuous-time correlator. a 
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jy (kT) | y2(kTs) 











—|-2-0-0-0-0-0-0-0-0-0-0-0-0-0-0-6-0-0-0-0-———> kT, 
Tp 
| y(kTs) | y2(kTs) 

Eo 742222222 =| 

—pooeoo-o-o-ooo-oo-oo-eoeoem T, k 
Tp Tp T, 
2 

Figure 8.27 The correlator outputs in Example 8.3.5 when s(t) is transmitted (top) and when s2(t) is 
transmitted (bottom). 





Example 8.3.6 
If the binary orthogonal signals illustrated in Figure 8.16 are used for the transmission « 
binary information, determine the outputs of the two correlators when the signal wavefor 
5; (£) is transmitted. 








Solution The two orthogonal signals in Figure 8.16 are expressed as a linear combination ı 
the two nonoverlapping basis functions y; (t)and W2(t), as shown in Figure 8.13. Thus, 

















Wi s(t) = silt) + syravro(t) = yY o/21 (1) + y Sn /2¥n(0) 
H | and 
i 50) = SV) + saat) = y 8/2710) — y Bs/2al0). 


When sj (t) is transmitted, the received signalis r(t) = s}(t)+n(t). Thetwo correlator outpt 
are 

















Ty 
Ym =f r(t)Wm(t) dt, m= 1,2. 
0 


By substituting for s; (t) in the preceding integral and using the orthogonality property of yy ( 


and y(t), we obtain 
yı =y Ss/2+m 


y = yĉ/2 +m, 





and 
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where the noise components are defined as 


Ty 
= f iode maha 
0 


As demonstrated, n; and nz are zero-mean uncorrelated Gaussian random variables, each hav- 
ing a variance 0? = No/2. Therefore, the joint probability density function of y; and y2, 
conditioned on s; (t) being transmitted, is 


2 2 
F (v1, yalsi) = (Fx) Pen à g n) a (8.3.25) 


J No 


a 
8.3.2 Matched-Filter-Type Demodulator 


Instead of using cross correlation to perform the demodulation, we may use a filter-type 
demodulator, as described in the next paragraph. As in the preceding section, we describe 
this demodulator for binary antipodal and binary orthogonal signals. 


Binary Antipodal Signals. As we have observed, in the case of binary antipodal 
signals, the received signal is 


r(t)=Smp(t)t+n(t), O<t<T,, m=1,2, (8.3.26) 


where w(t) is a unit energy pulse of duration T}. Suppose we pass the received signal r (t) 
through a linear time-invariant filter with impulse response 


h(t) = Y(T —t), 0<t< T. (8.3.27) 
The filter output is 
yt) = f r(t)h(t — t) dt. (8.3.28) 
0 


If we sample the output of the filter att = Tp, we obtain 


Tp 
yT) = Í r(t)h(T, — t) dt. 


But A(T, — t) = y (t). Therefore, . 
T, ž X 
y(Ts) = Í Tn 


T T, 
= Sm f “wate dee I "aatar (8.3.29) 
E 0 


= Sm tn. 
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s(t) h(t) =s(T- ù) 
A 





0 T t 


(b) Impulse response Figure 8.28 Signal s(t) and filter 
of filter matched to s(t) matched to s(t). 


(a) Signal s(t) 


Hence, the output of the filter at t = Tp is exactly the same as the output obtained with 
cross correlator. 

A filter whose impulse response h(t) = s(T — t), where s(t) is assumed to b 
confined to the time interval 0 < t < T, is called the matched filter to the signal s(t 
An example of a signal and its matched filter are shown in Figure 8.28. The response c 
h(t) = s(T — t) tothe signal s(t) is 


y(t) = Í s(t)s(T—t+rt)dī, (8.3.3( 
0 


which is basically the time-autocorrelation function of the signal s(t), as shown i 
Equation (2.3.37). Figure 8.29 illustrates y(t) for the triangular signal pulse shown i 
Figure 8.28. We note that the autocorrelation function y(t) is an even function of t, whic 
attains a peak at t = T. The peak value y(T) is equal to the energy of the signal s(t). 


Example 8.3.7 
For the signal s(t) shown in Figure 8.28(a), implement the matched filter in discrete-time ar 
compute the matched filter output when the input is the sampled version of s(t). Select tt 
sampling rate F, = 20/T; (sampling interval T, = T,/20). Plot the matched filter output. 


Solution The matched filter impulse response is 


A(t) = s(T, — t) 


Figure 8.29 The matched filter 
t output is the autocorrelation function 


s(t). 
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| y(kTs) 


408 Pog kT. 


Figure 8.30 The matched filter output in Example 8.3.7. 


and the continuous-time output is the convolution of h(t) with s(t), i.e., 


y(t) = f h(t)s(t — t) dt 
0 


Thus, the discrete-time output of the matched filter is 


k 
1 
IET) = 55 Y"h(mT,)s(kT, — mT) 
m=1 
1 k 
= 55 28207, — mT,)s(kT, — mT) 


m=) 


The plot of y(kT;) is shown in Figure 8.30. E 


Binary OrthogonalSignals. As we have observed inthe previous section, binary 
orthogonal signals are two-dimensional signals; hence, two correlators are needed to per- 
form the demodulation. In place of the correlators, two linear time-invariant filters may be 
employed. To be specific, consider the received signal 


r(t)=Sn(t) +n), O<t<T,, m=1,2, (8.3.31) 


where S(t) and m = 1, 2 are the two orthogonal waveforms given by Equation (8.2.13) 
and illustrated in Figure 8.12, and Ym(t) and m = 1, 2 are the two orthogonal basis func- 
tions shown in Figure 8.13. The impulse responses of the two filters matched to yy (t) and 
W(t) are defined as 


h@=wilh—-t), 0<t<f, 
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and 
hot) = WG - t), O<t <I. (8.3.32 


When the received signal r(t) is passed through the two filters, their outputs are 
t 
Y(t) = f r(t)hm(t —t) dt, m= 1,2. (8.3.33 
0 


If we sample the outputs of these filters at t = Tp, we obtain 


Tp 
poe Í Oad 
(8.3.34 


Th 
=| r(t)Wn(t) dt, m=1,2. 
0 


By comparing the outputs of the two matched filters att = T, with the outputs obtaine: 
from the cross correlators given by Equation (8.3.11), we observe that the outputs are iden 
tical. Therefore, the correlation-type demodulator and the matched-filter-type demodulato 
yield identical outputs at t = T. 


Properties of the Matched Filter. A matched filter has some interesting prope1 
ties. Let us prove the most important property, which may be stated as follows: If a signa 
S(t) is corrupted by AWGN, the filter with the impulse response matched to s(t) maximize 
the output signal-to-noise ratio (SNR). 

To prove this property, let us assume that the received signal r(t) consists of th 
signal s(t) and AWGN n(t), which has zero mean and a power spectral density S,(f) = 
No/2 W/Hz. Suppose the signal r(t) is passed through a filter with the impulse respons 
h(t), 0 < t < T, and its output is sampled at time t = T. The filter response to the signe 
and noise components is 


y(t) = r(t)h(t — t) dt 
0 


(8.3.35 
t t 
= | s(t)h(t —t)dt+ | n(t)h(t — t) dt. 
0 0 
At the sampling instant t = T, the signal and noise components are 
T T 
y(T) = i s(t)h(T —t)dt+ f n(t)h(T —t)dt 
0 0 (8.3.3¢ 


= ys(T) + y, (T), 


where ys(T) represents the signal component and y,(T) represents the noise compt 
nent. The problem is to select the filter impulse response that maximizes the output SNF 
defined as 
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(=) = xT) on 
Njo ET) S 


The denominator in Equation (8.3.37) is simply the variance of the noise term at the 
output of the filter. Let us evaluate E ( y2(T)). We have 


T T 
E(y2(T)) = ie f E(n(t)n(t)) A(T — t)h(T —t) dt dt 


No T T 
= all i 5(t— t)h(T — t)A(T — 1) dt dt (8.3.38) 
0 0 


N T 
= Sf n2(T — 1) dt. 
2 Jo 


Note that the variance depends on the power spectral density of the noise and the energy in 
the impulse response h(t). 

By substituting for y,(T) and E (y2 (1) i in Equation (8.3.37), we obtain the expres- 
sion for the output SNR as 


2 2 
( ‘ | JE s@acr — )dt| [ [2 h@s(T - t)dt| PORT 
a) — MERT-)d Xf eT —a)at i 


Since the denominator of the SNR depends on the energy in h(¢), the maximum output 
SNR over h(t) is obtained by maximizing the numerator of (S/N )o subject to the constraint 
that the denominator is held constant. The maximization of the numerator is most easily 
performed by using the Cauchy—Schwartz inequality, which generally states that if g; (t) 
and g>(t) are finite energy signals, then 


fore) l 2 oo oo 
| l gear < / gi(t) dt | g3(t) dt, (8.3.40) 


where equality holds when 8; (t) = Cg2(t) for any arbitrary constant C. If we set 81 (t) = 
A(t) and g2(t) = s(T —t), it is clearthat the (S/N) o is maximized when h(t) = Cs(T —t), 
i.e., h(t) is matched to the signal s(t). The scale factor C? drops out of the expression for 
(S/N)o, since it appears in both the numerator and the denominator. 

The output (maximum) SNR obtained with the matched filter is 


T 
(=) => f s?(t)dt 
Oo 


(8.3.41) 
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where ©, is the energy of the signal s(t). Note thatthe output SNR from the matched filter 
depends on the energy of the waveform s(t), but not on the detailed characteristics of s(t), 
This is another interesting property of the matched filter. 


Frequency-Domain Interpretation of the Matched Filter. The matched filter 
has an interesting frequency-domain interpretation. Since h(t) = s(T — t), the Fourier 
transform of this relationship is 


T 
H(f) =f s(T —t) e~J27Ft dt 


= f° s(t) rac e— stn fT l (8.3.42) 
0 


= PNT, 


We observe that the matched filter has a frequency response that is the complex conjugate 
of the transmitted signal spectrum multiplied by the phase factor e~/2"/7, which represents 
the sampling delay of T. In other words, |H(/)| = |S(/)|, so that the magnitude response 
of the matched filter is identical to the transmitted signal spectrum. On the other hand, the 
phase of H (f) is the negative of the phase of S(f), shifted by a linear function of T. 

Now, if the signal s(t), with spectrum S( f), is passed through the matched filter, the 
filter output has a spectrum Y (f) = |S(f)|?e7/2"4/7. Hence, the output waveform is 


ys(t) = f E Y (f) eif df 


—0o 
(8.3.43: 
[0] 
= J ISCA) eiT eif! df, 
— 0 
By sampling the output of the matched filter at t = T, we obtain 
o0 T 
y= J IS(AyPaf = f (f) dt =%,, (8.3.44 
—0o 0 
where the last step follows from Parseval’s relation. 
The noise of the output of the matched filter has a power spectral density 
So(f) =|H(f)PNo/2. (8.3.45 
Hence, the total noise power at the output of the matched filter is 
[oe 
P, = f Paf 
—00 
(8.3.46 


_ [> No 2 ->F 2gp — esNo 
=| Bupa- Ff OPa ==. 


ioe) 
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The output SNR is simply the ratio of the signal power 
P, = y? (T) (8.3.47) 


to the noise power P,,. Hence, 


O) 28m ee 
o Pn  €sNo/2 No’ i 





which agrees with the result given by Equation (8.3.41). 


Example 8.3.8 
Consider the binary orthogonal PPM signals, which are shown in Figure 8.12, for transmitting 
information over an AWGN channel. The noise is assumed to have zero mean and power 
spectral density No/2. Determine the impulse response of the matched filter demodulators, 
and the output waveforms of the matched filter demodulators when the transmitted signal is 


S1 (t). 


Solution The binary PPM signals have dimension N = 2. Hence, two basis functions are 
needed to represent the signals. From Figure 8.13, we choose y(t) and w(t) as 


(aes 
Witt) = 4) % 2 


0, otherwise 
2 T 
2, << 
y(t) = T 2 
0, otherwise 


These waveforms are illustrated in Figure 8.31(a). The impulse responses of the two matched 


filters are 
2 T; 
7 pstsh 
hit) = WT, — t) = $ 
0, otherwise 
and 


JZ 0 <t <T,/2 
hot) = WT, - t) = Th 


0, otherwise 


and are illustrated in Figure 8.31(b). 

If sı (t) is transmitted, the (noise-free) responses of the two matched filters are as shown 
in Figure 8.31(c). Since y; (t) and y2(t) are sampled at t = T,, we observe that ys (Tp) = Jb, 
and y2,(T,) = 0. Hence, the received vector formed from the two matched filter outputs at the 
sampling instant £ = T, is 


y=(1, 92) = (VE +n, m) , (8.3.49) 
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VORI plt) į 
2 Z 
o Tp T, 3T, t 0 A 
2 z 2 5 
(a) 
h(t) = W(T — t) } h(t) = YAT- À 4 
n 
aoe 
“0 Tp T, 3T, t 
2 2 
(b) 
yis(t) Yos(t) 4 
Ve, VG, |- 
ae IAS L 
0 Tp Tp 3T, t 0 Tr Tp 3T, t 
2: z 5 
(c) 


Figure 8.31 Basis functions and matched filter responses for Example 8.3.8. 


where n; = Yın (Tp) and n2 = y2,(T;) are the noise components at the outputs of the matche 
filters, given by 


Tb 
Yen (To) =f n(t)We(t)dt, k=1,2. (8.3.5 
0 


Clearly, E (ng) = E(yen(T,)) = 0. Their variance is 


Tp Tp 
o? =E(y;,(Ts)) = Í f E(n(t)n(t)) W(t) W(t) dt dt 
Tp Tı 
= = f Í ’ sa-l) dt dt (8.3.5 
0 0 


No To 2 No 
= =f wy(tdt = ay 


Observe that for the first matched filter, 


N 





No/2 No’ 


G (E) _ 28, 
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which agrees with our previous result, since Ê is the signal energy per information bit. Also, 
note that the outputs of the two matched filters corresponding to the transmitted signal s, (t) 


are Or, 92) = (m, VE +m). a 
8.3.3 The Performance of the Optimum Detector for Binary Signals 


In this section, we describe the optimum decision rule, which is employed by the detector 
to make decisions based on the output from the demodulator. For this development, we 
assume that the signals received in successive signal intervals are statistically independent, 
so the detector only needs to consider its input in a given bit interval when making a 
decision on the transmitted signal in that bit interval. 


Binary Antipodal Signals. As we have observed, the output of the demodulator 
in any signal (bit) interval is 


yY=5m +n, m = 1,2, (8.3.52) 


where Sm = +4/€, and n is a zero-mean Gaussian random variable with variance oa = 
No/2. The conditional probability density functions p(y|Sm), m = 1,2, are given by 
Equation (8.3.8) and illustrated in Figure 8.21. 

Since the input to the detector is a scalar, it is apparent that the detector compares 
y with a threshold œ, determines whether y > œ, and declares that the signal sı (t) was 
transmitted. Otherwise, it declares that s2(t) was transmitted. The optimality of this scheme 
will be shown in Section 8.4.1. Later in this chapter, we demonstrate that this decision rule 
maximizes the probability of making a correct decision or, equivalently, that it minimizes 
the probability of error. 

For the binary antipodal signals, the average probability of error as a function of the 
threshold a is 


Pala) = Pé) | Foiddy + PO) | Fylsa) dy, (8.3.53) 


where P (s1) and P (s2) are the a priori probabilities of the two possible transmitted signals. 
Let us determine the value of the threshold œ, say a*, that minimizes the average probabil- 
ity of error. Differentiating P2() with respect to œ and setting the derivative to zero, we 
obtain 


P(s;) f(@ls1) — P(s2)f(@ls2) =0, - 


or equivalently, 
falsi) _ Ps) 
f(a@|s2) Ps) 


Substituting the conditional probability density functions given by Equation (8.3.8) into 








(8.3.54) 


Equation (8.3.54) with sı = vp and s2 = ~y p, we have 
2 2 
e (VE) mo plet) io _ Pls2) (8.3.55) 


T Pe)’ 
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or equivalently, 


eta EBIN — P26) 
P (sı) 


Clearly, the optimum value of the threshold is 
at — M In P(52) 
4 Vv E b P (s 1) i 


We observe that if P (s1) > P (s2), then a* < 0, andif P (s2) > P (sı), thena* > 0 
In practice, the two possible signals are usually equally probable, i.e., the a priori probabil. 
ities P (s1) = P(s2) = 1/2. Here, Equation (8.3.56) yields the threshold œ* = 0. For thi: 
case, the average probability of error is 








(8.3.56 


0 fore) 
P=} folndy+4 Í FOvlsa)dy 
(8.3.57 


0 : | 
1 Hon ; 
= J f(ylsı)dy = x] y V8) Mo gy, 
a V1No J- 
By making a simple change in variable, specifically x = (y — SE) //No/?, the integra 
may be expressed as v ) v 


—JS%Es/No j 
e* 2 dx 


A) 


where Q(x) is the area under the tail of the normal (Gaussian) probability density function 


defined as 
ome [ era l J S eR dy (8.3.59 
x) = —— e u = —— 5 3. 
V 2T J- y 27 Jx 


For more properties of Q(x), see the discussion following Equation (5.1.7). We observ 
that the probability of error depends only on the SNR 2@,/No and not on the detaile, 
characteristics of the pulse waveforms. Also, Equations (5.1.8)—(5.1.10) clearly show tha 
the error probability tends to zero exponentially as SNR increases. 


(8.3.58 


Binary Orthogonal Signals. Inthe case of binary orthogonal signals, the outpu 
of the demodulator is the two-dimensional vector y = (yı, y2), where yı and yz are th 
outputs of the two cross correlators or the two matched filters. Recall that if the signal sı (t 
is transmitted, the demodulator outputs are 


= V6 +n 


y2 = M2, 


and 
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where the noise components nı and nz are statistically independent, zero-mean Gaussian 
variables with variance o? = No/2. For the important case in which the two orthogonal 
signals are equally probable, i.e., P(s1) = P(s2) = 1/2, the detector that minimizes the 
average probability of error simply compares yı with y2, If yı > y2, the detector declares 
that sı (t) was transmitted. Otherwise, it declares that s2(t) was transmitted. The optimality 
of this scheme follows from the discussion in Section 8.4.1. Based on this decision rule, 
assuming that s;(¢) was transmitted, the probability of error is simply the probability that 
yı — y2 < 0. Since yı and y, are Gaussian with equal variance o = No/2 and statistically 
independent, the difference 


Z=y)1— y2 
=V + aei (8.3.60) 


is also a Gaussian random variable with mean ./%, and variance o? = No. Consequently, 
the probability density function of z is 





ee ‘amo (8.3.61) 
and the average probability of error is 
0 
P2 = P@ <0)= | f(z) dz 
Ki (8.3.62) 


==]. OM nazo Sp 
— ASIn —00 a No f 


When we compare the average probability of error of binary antipodal signals given 
by Equation (8.3.58) to that of binary orthogonal signals, we observe that, for the same 
error probability P2, the binary antipodal signals require a factor of two (3 dB) less signal 
energy than orthogonal signals. The graphs of P, for these two signal types are shown in 
Figure 8.32. 


Performance of General Binary Equiprobable Signaling. Our derivation of 
the performance of binary antipodal and binary orthogonal signaling, for equiprobable 
messages, shows that 


26 
P= Q a for binary antipodal, (8.3.63) 
0 
Ke 
P =Q A for binary orthogonal, (8.3.64) 


where @, is the energy per bit and No/2 is the noise power spectral density. 
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Figure 8.32 Probability of error for binary signals. 


We make two interesting observations about the form of Pz. First, we note that the 
probability of error depends only on the ratio @,/No and not on any other detailed charac- 
teristics of the signals and the noise. Second, we note that 2€,/Np is also the output SNR 
from the matched filter (and correlation-type) demodulator. The ratio €,/No is usually 
called the signal-to-noise ratio or SNR/bit. 

We also observe that in both cases the probability of error may be expressed in terms 
of the distance between the two signals sı and s2. From Figure 8.7, we observe that in the 
case of binary antipodal signaling, the two signals are separated by the distance diz = 
2/ Sp. Substituting 6, = d?,/4 in Equation (8.3.63), we obtain 


2 
dh 


P =Q No 


(8.3.65) 
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Similarly, for binary orthogonal signaling, from Figure 8.11 we have diz = /2%, and 
substituting €, = d?,/2 in Equation (8.3.64) results in Equation (8.3.65). 

Equation (8.3.65) illustrates the dependence of the error probability on the Euclidean 
distance between the two signal points. Note that in general the Euclidean distance between 
the two signal points can be expressed directly in terms of the signal waveforms as 


diy = f (si (t) — s2(t))? dt. (8.3.66) 


It can be shown that Equation (8.3.65) can be used for computing the error proba- 
bility of any binary communication system with two equiprobable messages corrupted by 
AWGN. 

Example 8.3.9 

Signal waveforms s)(t) and s(t) shown in Figure 8.33 are used to transmit equiprobable 

binary data over an AWGN channel. Determine the bit-error probability of this system. 


Solution We can write 
2 O0<t<T,/2 


(= 41 7,/2<t<T, and 52(t) = | 
0 otherwise 


2t/T, 0<t<Tp 
0 otherwise ` 


Therefore, 


Te 
dh = I 6) — s(t)? dt 


Tp/2 Tp 
= Í (2 —2t/T,)* dt +f (1 — 2t/T,}? dt 
0 


To/2 
T,/2 Tı 
-| Te (2 zy bf +| Te (1 3 á 
6 To á 6 Tp e 
= 3h. 


$ s6) 





T2 Te 


Figure 8.33 Binary waveforms in Example 8.3.9. 
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Substituting into Equation (8.3.65) yields 


8.4 M-ARY DIGITAL MODULATION 


In Sections 8.2 and 8.3, we described the transmission of one bit at a time by employing 
two signal waveforms that are either antipodal or orthogonal. In this section, we consider 
the simultaneous transmission of multiple bits by using more than two signal waveforms. 
In this case, the binary information sequence is subdivided into blocks of k bits, called 
symbols, and each block (or symbol) is represented by one of M = 2* signal waveforms, 
each of duration T. Thistype of modulation is called M-ary modulation. Here we define the 
signaling (symbol) rate, Rs, as the number of signals (or symbols) transmitted per second. 
Clearly, 


Rs = a 
T 
and since each signal carries k = log, M bits of information, the bit rate is given by 
k 
R, = kR; = T 
The bit interval is 
T, = a. = G 
Rè k 


it is shown in Figure 8.34. 
The M signal waveforms may be one-dimensional or multidimensional. The one- 
dimensional M-ary signals are a generalization of the binary PAM (antipodal) signals. 


8.4.1 The Optimum Receiver for M-ary Signals in AWGN 


In Section 8.3, we described the optimum receiver for binary antipodal and binary orthog: 
onal signals in AWGN. In this section, we derive the optimum receiver for M-ary signal: 
corrupted by AWGN. 

As described above, the input sequence to the modulator is subdivided into k-bi 
blocks, called symbols, and each of the M = 2* symbols is associated with a correspondin, 
signal waveform from the set {Sm(t), m = 1, 2, ., M}. Each signal waveform is transmittec 





T». = bit interval 
T = symbol interval 


Figure 8.34 Relationship between the symbol interval and the bit interval. 
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withinthe symbol (signaling) interval or time slot of duration T. To be specific, we consider 
the transmission of information over the interval 0 < t < T. 

The channel is assumed to corrupt the signal by the addition of white Gaussian noise, 
as shown in Figure 8.18. Thus, the received signal in the interval 0 < t < T may be 
expressed as 


r(t) =Smlt)+n(t), O<t <T, (8.4.1) 


where n(t) denotes the sample function of the AWGN process with the power spectral 
density S (f) = Mo W/Hz. Based on the observation of r(t) over the signal interval, we 
wish to design a receiver that is optimum in the sense that it minimizes the probability of 
making an error. 


The Signal Demodulator. As in the case of binary signals, it is convenient to 
subdivide the receiver into two parts: the signal demodulator and the detector. The function 
of the signal demodulator is to convert the received waveform r (t) into an N-dimensional 
vector y = (y1, y2,---; YN), where N is the dimension of the transmitted signal wave- 
forms. The function of the detector is to decide which of the M possible signal waveforms 
was transmitted based on observation of the vector y. 

We have shown that the M-ary signal waveforms, each of which is N-dimensional, 
may be represented in general as i 


N 
Sn(t) = X smkVe(t), O<t<T, 


(8.4.2) 
k=1 S 
m=1,2...,M, 
where {Smk} are the coordinates of the signal vector 
Sm = (Sm1, Sm2, ws SmN), m=1,2,....,M, (8.4.3) 
and y(t) and k = 1,2,...,N are N orthonormal basis waveforms that span the 


N-dimensional signal space. Consequently, every one of the M possible transmitted sig- 
nals of the set {sm(t), 1 < m < M} can be represented as a weighted linear combination 
of the N basis waveforms {¥;,(t)}. 

In the case of the noise waveform n(t) in the received signal r(t), the functions 
{wx (t)} do not span the noise space. However, we will show that the noise terms that fall 
outside the signal space are irrelevant to the detection of the signal. 

As a generalization of the demodulator for binary signals, we employ either a 
correlation-type demodulator that consists of N cross correlators in parallel, as shown in 
Figure 8.35, or a matched-filter-type demodulator that consists of N matched filters in par- 
allel, as shown in Figure 8.36. To be specific, suppose the received signal r(t) is passed 
through a parallel bank of N cross correlators. The correlator outputs at the end of the. 
signal interval are 


T T 
f r(t)y (edt =f [sm (6) + nJ) dt, 
9 0 (8.4.4) 


Yk = Smk + Mk, k=1,2,...,N, 
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Figure 8.35 Correlation-type demodulator. 


Received 
signal 


r(t 
@ To detector 





Figure 8.36 Matched-filter-type demodulator. 
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where 


T 
s= f Sm(tWelt)dt, k=1,2,..., N, 
0 
(8.4.5) 


T 
nk = f npr (t)dt, k=1,2,...,N. 
0 
Equation (8.4.4) is equivalent to 


Y=Smtn, (8.4.6) 


where the signal is now represented by the vector Sm with components Smk and k = 
1,2,..., N. Their values depend on which of the M signals was transmitted. The com- 
ponents of n, i.e., {nx}, are random variables that arise from the presence of the additive 
noise. 

In fact, we can express the received signal r (t) in the interval 0 < t < T as 


N N 
r(t) = Do Smet) + Y nyl) + n'(0) 
k=1 


= k=l 
(8.4.7) 
N 
= J y(t) +n). 
k=l 
The term n’(t), defined as 
N $ 
n'(t) =n(t)— Y mlt), (8.4.8) 
k=1 


is a zero-mean Gaussian noise process that represents the difference between the original 
noise process n(t) and the part that corresponds to the projection of n(t) onto the basis 
functions {y;,(t)}. We will show that n’(t) is irrelevant when we decide which signal was 
transmitted. Consequently, the decision may be based entirely on the correlator output 
signal and noise components yk = Smk + nk, k =1,2,...,N. 

Since the signals {Sm(t)} are deterministic, the signal components {Smk} are deter- 
ministic. The noise components {n,} are Gaussian. Their mean values are 


T 
E(n) = Í E(n(t)) W(t) dt =0 (8.4.9) 
0 : 
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for all k. Their covariances are 


T T 
E(nknm) =f Í E(n(t)n(t)) We) Vn (t) dt dt 


T T No 
A z] i S(t — t)Wx(t)Wm (T) dt dt 
Zm o Jo 2 
í (8.4.10) 
N T 
=> | Wela lt) dt 
0 
Ni 
= bts 


where 6% = 1 when m = k and will otherwise be zero. Therefore, the N noise components 
{n,} are zero-mean uncorrelated Gaussian random variables with a common variance o? = 


No/2, and it follows that 


dh My E Ja (8.4.11) 


From the previous development, the correlator outputs {yg} conditioned on the mtt 
signal being transmitted are Gaussian random variables with mean 


E (yx) = E (Smk + Nk) = Smk (8.4.12 





and equal variance 


o? = o? = No/2. (8.4.13) 


Since the noise components {ną} are uncorrelated Gaussian random variables, they are alsc 
statistically independent. As a consequence, the correlator outputs {yg} conditioned on the 
mth transmitted signal are statistically independent Gaussian variables. Hence, the condi: 
tional probability density functions (PDFs) of the random variables (y1, y2,..., yw) =) 
are simply 


N 
fOlSm) = | | fOrlsm), m=1,2,...,M, (8.4.14 
k=1 


where 


eOk- INO ke = 1,2, ..., N. 4.1 


an 





1 
FS OrlSmk) = Tae 
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Substituting Equation (8.4.15) into Equation (8.4.14), we obtain the joint conditional PDFs 


as 
1 N 
ESEE oe = pa 2 
f(y|8m) = (nN)? exp l 20k Smk) m (8.4.16) 
= [- | y— s |? /No], m=1,2,...,M. (8.4.17) 
N “i 
As a final point, we wish to show that the correlator outputs (y1, y2,..., Yy) are 


sufficient statistics for reaching a decision on which of the M signals was wansmitted. 
In other words, we wish to show that no additional relevant information can be extracted 
from the remaining noise process n’(t). Indeed, n’(t) is uncorrelated with the N correlator 
outputs {yx}, i.e., l ` 

E (n' (Hye) = En (t))smk + E(n'(t)nx) 


= E(n'(t)n,) 


N 
=E n(t) -Y ny) nk 


j=l 


T N 
i E(n(t)n(t)) Ye (t) dt — Y Ejn) 


j=l 


= MO h(t) = NO y(t) = 0. (8.4.18) 
2 2 e 
This means n'(t) and rx are uncorrelated because n’(t) is zero mean. Since n'(t) and {y4} 
are Gaussian and uncorrelated, they are also statistically independent. Consequently, n’(t) 
does not contain any information that is relevant to the decision as to which signal wave- 
form was transmitted. All the relevant information is contained in the correlator outputs 
{yk}. Hence, n'(t) may be ignored. 





Example 8.4.1 
Consider an M = 4 PAM for a baseband AWGN channel in which the signal waveforms are 


SES ATOR RET eke lee eee ee NN ee ASENA TS 


Sm(t) = SmW(t), O<t<T, 





where W(t) is a unit energy rectangular pulse of duration T and the four signal points {Sm} 

are +d and +3d. Thus, the signal points are symmetric with respect to the origin and equally 

spaced with distance 2d between adjacent signal points. The additive noise is a zero-mean 

white Gaussian noise process with the spectral density No/2. Determine the PDF of the 

received signal at the output of the demodulator and sketch the PDFs for the case M = 4. | i ) 
: XM? 


Solution The received signal is expressed as 40 


r(t) = SmY (t) + n(t) Wx 
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Figure 8.37 PDFs for M = 4 received PAM signals in additive white Gaussian noise. 
and the output of the demodulator is 
T T 
WD =f rowod= f bO HOWO dr 
0 0 


= Sm +n, 


where n is a zero-mean Gaussian random variable with variance o? = No/2. Therefore, tt 
PDF of y = y(T) is 


1 
4/ TNo 





e7 075m No m=1,2,...,M 


f(y|Sm) = 
and Sm = (2m —1—M)d. 
The PDFs for M = 4 PAM are shown in Figure 8.37. 


Example 8.4.2 
Consider a set of M = 4 orthogonal signal waveforms in which the four signal points are 


sı = (V.,0,0,0), 
s2 = (v0, &, 0,0), 
s3 = (0,0, v%.,0), 
s4 = (0,0,0, v%,). 


The additive noise is a zero-mean white Gaussian noise process with a spectral density No/ 
Determine the PDF of the received signal vector y at the output of the demodulator, assumi 
that the signal s,(t) was transmitted and sketch the PDFs of each of the components of tl 
vector y. 


Solution The signal vector corresponding to the transmitted s;(t) is 
sı = (v%;, 0, 0, 0) 


and the received signal vector is, according to Equation (8.4.6), 


y =Sıi +n 


= (vE + ny, n2, n3, na), 
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fOilsy) 





fOis) i = 2,3,4 


Figure 8.38 The PDFs of the 
Yi received signal components of the 
vector y in Example 8.4.2. 


where the noise components 7), n2, n3, n4 are mutually statistically independent, zero-mean 
Gaussian random variables with identical variance o? = No/2. Therefore, the joint PDF of 
the vector components yj, y2, y3, Y4 is, according to Equation (8.4.16) with N = 4, 


2 
1 [a-v E) opni m 
e : 
No) 


The PDFs of each of the components of the vector y areshown in Figure 8.38. We note 
that the PDFs of y2, y3, y4 are identical. C 





fOlsi) = FOr, ya, Y3, yalsı) = 
(x 


The Optimum Detector. As we have observed from this development, when a 
: signal is transmitted over an AWGN channel, either a correlation-type demodulator or a 
matched-filter-type demodulator, produces the vector y = (y1, y2,---, YN), which con- 
tains all the relevant information in the received signal waveform. The received vector y 
is the sum of two vectors. The first vector is Sm, the signal vector that is equivalent to the 
transmitted signal, and the second vector is n, the noise vector. The vector Sm is a point 
in the signal constellation, and the vector n is an N-dimensional random vector with sta- 
tistically independent and identically distributed components, where each component is a 
Gaussian random variable with mean 0 and variance No/2. Since the components of the 
noise are independent and have the same mean and variance, the distribution of the noise 
vector n in the N-dimensional space has spherical symmetry. When Sm is wansmitted, the 
received vector y, which represents the transmitted signal Sm plus the spherically symmet- 
ric noise n, can be represented by a spherical cloud centered at sm. The density of this 
cloud is higher at the center and becomes less as we depart from sm, i.e., these points 
become less likely to be received. The variance of the noise No/2 determines the density 
of the noise cloud around the center signal Sm. For low No/2, the cloud is closely centered 
around Sm and its density (representing the probability) reduces sharply as the distance 
from the center is increased. For high No/2, the cloud is spread and larger distances have 
a higher probability compared with the low No/2 case. The signal constellation, the noise 
cloud, and the received vector are shown in Figure 8.39 for the case of N = 3 and M = 4. 

We wish to design a signal detector that makes a decision on the wansmitted signal 
in each signal interval based on the observation of the vector y in each interval, such that 
the probability of a correct decision is maximized. With this goal in mind, we consider a 
decision rule based on the computation of the posterior probabilities defined as 


P (signal Sm was transmitted |y) m=1,2,...,M, (8.4.19) 
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Figure 8.39 Signal constellation, nois 
cloud, and received vector for N = 3 
and M = 4. It is assumed that s; is 
transmitted. 


which we abbreviate as P(Sm|y). The decision criterion is based on selecting the signa 
corresponding to the maximum of the set of posterior probabilities {P (Sm|y)}. At the enc 
of this section, we show that this criterion maximizes the probability of a correct decision 
hence, it minimizes the probability of error. Intuitively, this decision is the best possibli 
decision that minimizes the error probability. It is clear that in the absence of any receive: 
information y, the best decision is to choose the signal Sm that has the highest prior proba 
bility P(s,,). After receiving the information y, the prior probabilities P(s,,) are replace: 
with the posterior (conditional) probabilities P(s,,|y), and the receiver chooses the Sm tha 
maximizes P(s,,|y). This decision criterion is called the maximum a posteriori probabilit 
(MAP) criterion. 
Using Bayes’s rule, we may express the posterior probabilities as 


P(sm)y) = LOP Gn) v a 


f(y) 
where f(y|Sm) is the conditional PDF of the observed vector given Sm, and P (Sm) is the 
priori probability of the mth signal being transmitted. The denominator of Equation (8.4.2 
may be expressed as 


M 
FO) =O FO/8m)P Sm): (8.4.21 


m=1 


From Equations (8.4.20) and (8.4.21), we observe that the computation of the poste 
rior probabilities P(s,,|y) requires knowledge of the a priori probabilities P (sm) and th 
conditional PDFs f(y|Sm) form = 1, 2,..., M. 

Some simplification occurs in the MAP criterion when the M signals equal] 
probable a priori, i.e., P(Sm) = 1/M for all M. Furthermore, we note that the denominat 
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in Equation (8.4.20) is independent of which signal is transmitted. Consequently, the deci- 
sion rule based on finding the signal that maximizes P (Sm|Y) is equivalent to finding the 
signal that maximizes f(y|sm). 

The conditional PDF f(y|s,,), or any monotonic function of it, is usually called the 
likelihood function. The decision criterion based on the maximum of f (y|Sm) over the M 
signals is called the maximum-likelihood (ML) criterion. We observe that a detector based 
on the MAP criterion and one that is based on the ML criterion make the same decisions, 
as long as the a priori probabilities P (Sm) are all equal; in other words, the signals {Sm} are 
equiprobable. 

In the case of an AWGN channel, the likelihood function f (y|Sm) is given by Equa- 
tion (8.4.16). To simplify the computations, we may work with the natural logarithm of 
f (ylsm), which is a monotonic function. Thus, 


-N A 
In f(yl8m) = = CENO) = FD Ok Sm). (8.4.22) 
k=1 


The maximum of ln f (y|Sm) Over Sm is equivalent to finding the signals Sm that minimize 
the Euclidean distance 


N 
DY, Sm) = X Ok — Smk). (8.4.23) 
k=1 ‘ 


We call D(y, Sm) m = 1,2, ..., M, the distance metrics. Hence, for the AWGN channel, 
the decision rule based on the ML criterion reduces to finding the signal Sm that is closest 
in distance to the received signal vector y. We will refer to this decision rule as minimum 
distance detection. 

Another interpretation of the optimum decision rule based on the ML criterion is 
obtained by expanding the distance metrics in Equation (8.4.23) as 


N N N 
D(Y, Sm) = > y2 2) YnSmn +) Sian 

n=1 n=1 n=1 (8.4.24) 
=|| y |? —2y-smt ll Sm l2,  m=1,2,..., M. 


The term | y |]? is common to all decision metrics; hence, it may be ignored in the com- 
putations of the metrics. The result is a set of modified distance metrics 


D'(y, Sm) = —2y-Sm+ |l Sm I? - (8.4.25) 


Note that selecting the signal Sm that minimizes D’(y, Sm) is equivalent to selecting the 
signal that maximizes the metric C(y, Sm) = —D' (Y, Sm), i.e., m PAAA 
wy 


CY, Sm) = 2Y-Sm— I Sm If (8.4.26) 


The term y-s,, represents the projection of the received signal vector onto each of the M 
possible transmitted signal vectors. The value of each of these projections is a measure of 
the correlation between the received vector and the mth signal. For this reason, we call 




















































































































394 Digital Modulation in AWGN Channel Chapter ¢ 


C(Y, Sm), m = 1,2,..., M, the correlation metrics for deciding which of the M signal; 
was transmitted. Finally, the terms || Sm "= Emm = 1,2, ... M, may be viewed as bia; 
terms that serve as compensation for signal sets that have unequal energies, such as PAM 
If all signals have the same energy, || Sm ||? may also be ignored in the computation of the 
correlation metrics C(y, Sm) and the distance metrics D’(y, Sm). 

In summary, we have demonstrated that the optimum ML detector computes a se 
of M distances D(y, Sm) or D’(y, Sm) and selects the signal corresponding to the smalles 
(distance) metric. Equivalently, the optimum ML detector computes a set of M correlatio1 
metrics C(y, Sm) and selects the signal corresponding to the largest correlation metric. 

This development for the optimum detector treated the important case in which al 
signals are equally probable. In this case, the MAP criterion is equivalent to the ML crite 
rion. However, when the signals are not equally probable, the optimum MAP detector base: 
its decision on the probabilities P(s,,|y), m = 1, 2,..., M given by Equation (8.4.20), o 
equivalently, on the posterior probability metrics, 


PMO, Sm) = f (18m) P (Sm). (8.4.27 
The following example illustrates this computation for binary PAM signals. 


Example 8.4.3 
Consider the case of binary PAM signals in which the two possible signal points are sı = 
=s = VEn: where ©, is the energy per bit. The prior probabilities are P (sı) and P (s2) 
Determine the metrics for the optimum MAP detector when the transmitted signal is corrupte: 
with AWGN. 


Solution The received signal vector (which is one dimensional) for binary PAM is 


y=stv€, +n, 


where n = y„(T) is a zero-mean Gaussian random variable with a variance o = No/z 
Consequently, the conditional PDFs f(y|s,,) for the two signals are 


fos) = e > vB) [en 





1 
210? 


2 
and a 1 << (r +6) N02 
= ——— e . 
am 2102 
Then the metrics PM(y, sı) and PM(y, s2) defined by Equation (8.4.27) are 
PM(y, sı) = FQ ls) Psi) 
and 
PM(y, s2) = f Ols2)P (s2). 


If PM(y, sı) > PM(y, s2), we select s, as the transmitted signal; otherwise, we 6 


decision rule may be expressed as i 
D a 


PMO, 52) 5 
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But 





ao _ PO) l v%)- (o -v€ | 202}, 


PM(y, s2) P (s2) 
so Equation (8.4.28) may be expressed as 


CVE) =O - VB) ap pun 
202 


MPG 





or equivalently, 







ya No a3 P(s2) 
re Ka P(s) (si) 


This is the final form for tector. Its input y from the demodulator is 
compared with the threshold (No a E) In (P (s2)/ P (s1)). We note that this is exactly the 
same detection rule that was obtained in Section 8.3.3, Equation (8.3.56), for binary antipodal 
signals (binary PAM). 

It is interesting to note that in the case of unequal prior probabilities, it is necessary to 
know not only the values of the prior probabilities but also the value of the power spectral 
density No and the signal energy %, in order to compute the threshold. When the two signals 
are equally probable, the threshold is zero. Consequently, the detector does not need to know 
the values of these parameters. a 


(8.4.29) 


We conclude this section with the proof that the decision rule based on the ML cri- 
terion minimizes the probability of error when the M signals are equally probable a priori. 
Let us denote by Rm the region in the N -dimensional space for which we decide that signal 
Sm(t) was transmitted when the vector y = (y1, y2, .. . Yy) is received. The probability of 
a decision error given that Sm (t) was transmitted is 


P(elsm) = f FOlSm)dy, (8.4.30) 
Rn 
where Rf, is the complement of Rm. The average probability of error is 
M 
Pu = D> = P(elsm) 
m=1 
xıl 
=J — | fOlsm)dy (8.4.31) 
m=) M Rin 


i 
i 


f Olsmad | . 


ae 
=a! 


We note that Py is minimized by selecting the signal Sm if f(yls) is larger than f(ylsx) 
for all m Ak. 


Rm 
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Similarly for the MAP criterion, when the M signals are not equally probable, the 
average probability of error is 


M 
Py=1=); f P(SnlY)fO)dY. (8.4.32) 
Rm 


Py is a minimum when the points that are to be included in each particular region Rm are 
those for which P (Sm ly) exceeds all other posterior probabilities. 


8.4.2 A Union Bound on the Probability of Error 


In the preceding sections, we have introduced a number of binary communication systems 
and their corresponding optimal detection schemes. We have also derived expressions for 
the error probability of these systems in terms of their SNR/bit (€,/No). Most of the error 
probability expressions are derived for equiprobable messages, where the optimal decision 
rule is the maximum-likelihood rule. 

We have seen that for binary equiprobable signaling over an AWGN channel, regard- 
less of the signaling scheme, the error probability can be expressed as : 


d 
Ph=Q (se) (8.4.33) 


where d is the (Euclidean) distance between the two signal points in the constellation and 
is related to the signal waveforms via 


> J GO — so(t))? dt. 


A natural question here is whether a similar simple expression exists for the error 
probability of the general equiprobable M-ary signaling. As discussed above, Equation 
(8.4.31) gives an expression for the error probability of a general equiprobable M-ary 
signaling scheme in AWGN. However, this equation cannot be reduced further into a closed 
form for error probability because, in general, the decision regions Rm are irregular and the 
integral of the Gaussian function over them does not have a closed form. However, there 
is a simple upper bound to the error probability of general equiprobable M-ary signaling 
systems. This upper bound is known as the union bound; we will derive this upper bound 
next. 

Let us assume that s,,(t) is transmitted in an M-ary equiprobable signaling scheme. 
The error probability is the probability that the receiver detects a signal other.than Sm (t). 
Let E; denote the event that message i is detected at the receiver. Then 


M é 
Sm(t) sent | <)> P (E;| Sm(t) sent). 


i=l i=l 
im im 


Pn = P(error| S(t) sent) = P U Ei 





(8.4.34) 
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A necessary (but not sufficient) condition for message s;(t) to be detected at the 
receiver when signal s,,(t) is sent is that y be closer to s; than to Sm, i.e., 


= 7 D(y, si) < D(Y, Sm). 
Thus, we conclude that 


P (Eil Sm(t) sent) < P (D(y, Si) < D(Y, Sm)). (8.4.35) 


But P (D(y, si) < D(y, Sm)) is the probability of error in a binary equiprobable signaling 


system, which, by Equation (8.4.33), is given by 








dmi 
P (D(y, Si D(Y, Sm)) = ; 8.4.36 
(D(y en )) o(-=) ( ) 
From Equations (8.4.35) and (8.4.36), we conclude that 
P (Ei sent) < o ( dmi ) (8.4.37) 
il m = ` 4. 
J2No 


Substituting Equation (8.4.37) into Equation (8.4.34) yields 


pas o(a an =). (8.4.38) 
Firea 





We define the minimum distance of a signal set as the minimum of the distances 
between any two points in the corresponding signal constellation. In other words, 


dmin = min dmm . (8.4.39) 
1<m,m'<M 


m! Am 


Then, for any 1 < i,m < M, we have dmi > dmin, and since the Q-function is a decreasing 


function [see the discussion following Equation (5.1.7)], we have 


dmi dmin 
< 8.4.40 
o(-#-)<0(=). (8440) 
Using the inequality alternative of Equation (8.4.40) in Equation (8.4.38) yields 















4 Pas Ý 


7 
/ 





j=% -no (= (8.4.41) 


i ih 





It is clear tħat-this-bound is independent of m. Therefore, it is valid independently of 
which signal was wansmitted. Hence, we conclude that 








ie dmi M-—1 — 4h 
~1 yp, <m- po (2 )s A, 8.4.42 
M m > <( vo (= se (8.4.42) 
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where the last step uses the inequality Q(x) < 1/2e-*' from Equation (5.1.8). Equa. 
tion (8.4.42) is known as the union bound for the error probability. 

As can be seen from Equation (8.4.42), there are two versions of the union bound 
The first version is in terms of the Q-function, and the second version is in terms of thx 
exponential function. The union bound provides a very useful bound on the error proba 
bility, particularly at high signal-to-noise ratios, Atlower SNRs, the bound becomes loos 
and useless. To analyze the performance of a system at low SNRs, more powerful bounding 
techniques are needed. 

The union bound also signifies the role of the minimum distance of a signal set o1 
its performance, particularly at large SNRs. A good signal set should provide the maxi 
mum possible value of dmin. In other words, to design a good signal set, the points in thi 
corresponding constellation should be maximally apart. 


8.5 M-ARY PULSE AMPLITUDE MODULATION 


The generalization of binary PAM to M-ary PAM is relatively straightforward. The k-bi 
symbols are used to select M = 2* signal amplitudes. Hence, in general, the M-ary PAN 
signal waveforms for a baseband channel may be expressed as 


Sm(t) = Amp(t), m=1,2,...,M (8.5.1 
=smW(t), m=1,2,...,M (8.5.2 


where p(t) is a lowpass pulse signal of duration T and y(t) = p(t)/,/€, is the normalize 
version of it. In the special case where p(t) is a rectangular pulse shape we can write 


Sm(t) =Amgr(t), O<t<T, m=1,2,..., M 


= Sm (t), 0<t<T, m=1,2,...,M, (8.5.3 


where the pulse gy (t) and the basis function y(t) are shown in Figures 8.40(a) and 8.40(b 
respectively. We observe that all M signal waveforms have the same pulse shape. Henc 
they are one-dimensional signals. We also note that, in general, Sm = Am/€p which in th 
special case when 


1 O<t<T 
0 otherwise 


p(t) = gr(t) = | 


gr (t) y(t) 
1 
VIT 
0 T $ 0 T i 


Figure8.40 Rectangular pulse gy (t) as 
(a) (b) basis function y(t) for M-ary PAM. ` 
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becomes Sm = Am/T. An important feature of these PAM signals is that they have differ- 
ent energies. That is, 


T T 
Em = f s? (tdt = s2 f w(tdt = s? = A2E,. (8.5.4) 
0 0 < 


Assuming that all k-bit symbols are equally probable, the average energy of the wansmitted 
signals is 


ba es 
y= aN A 8.5. 
Ea no l M 2 (8.5.5) 


In order to minimize the average transmitted energy and to avoid transmitting signals 
with a DC component, we want to select the M signal amplitudes to be symmetric about 
the origin and equally spaced. That is, 





Am = (2m — 1 — M)A, m=1,2,...,M, (8.5.6) 
where A is an arbitrary scale factor. The corresponding average energy, assuming that all 
k-bit symbols are equally probable, is a A 

eas 
AS, X 
by = X Om-1-M? 
m=1 (8.5.7) 


= A?€,(M? — 1)/3. 


For the signal amplitudes given in Equation (8.5.6), the corresponding signal con- 
stellation points in a geometric representation of the M-ary PAM signals are given as 


Sm = Amy €p (8.5.8) 
= A/Z, Qm-1-M), m=1,2,..M. g 


It is convenient to define the distance parameter d as d = A,/® p, so that 
Sm = (2m — 1— M)d, m=1,2,...,M. (8.5.9) 


The signal constellation point diagram is shown in Figure 8.41. Note that the distance 
between two adjacent signal points is 2d. 


Example 8.5.1 


Sketch the signal waveforms for M = 4 PAM, which are described by Equation (8.5.3) and 
determine the average transmitted signal energy. 


2d 
-5d -3d d o - 3d Sd 


Figure 8.41 M = 4 PAM signal waveforms. 
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sı(t) s2(t) 
T T 
t t 
—A 
—3A 
83(t) sa(t) 
3A 
A 
t t Figure8.42 M =4PAMsignal 
T T _ waveforms. 


Solution The four signal waveforms are shown in Figure 8.42. 
The average energy, based on equally probable signals according to Equation (8.5.7), 


Cay = 5A°T = 5d’, 


where d? = AT by definition. 
8.5.1 Carrier-Modulated PAM for Bandpass Channels (M-ary ASK) 


To transmit the digital signal waveforms through a bandpass channel by amplitude modul 
tion, the baseband signal waveforms s,,(t),m = 1, 2,..., M are multiplied by a sinusoid 
carrier of the form cos 21 fst, as shown in Figure 8.43. In this figure, f, is the carrier fr 
quency and corresponds to the center frequency in the passband of the channel. Thus, tl 
transmitted signal waveforms may be expressed as 


Um(t) = S(t) cos 27 fet, m=1,2,...,M.. (8.5.1 
As previously described in Section 3.2, amplitude modulation of the carrier cos 27, 


by the baseband signal waveforms {Sm(t)} shifts the spectrum of the baseband signal | 


Baseband signal Bandpass signal 






Sm(t) cos 21f,t 


Carrier 


Figure 8.43 Amplitude modulation of the sinusoidal carrier by 
cos 2nf,t 


baseband signal. 
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ISa ADI 
1 
— ! 
-W 0 Ww f 

(a) 
[UnA] 

IL 
2 

Sfo=W° ie FetW 0 f.-W fe ftw f 


Figure 8.44 Spectra of (a) baseband and (b) amplitude-modulated signals. 


an amount f,; thus it places the signal into the passband of the channel. Recall that the 
Fourier transform of the carrier is HCS — fc) +6(f + fe)]. Because multiplication of two 
signals in the time domain corresponds to the convolution of their spectra in the frequency 
domain, the spectrum of the amplitude-modulated signal given by Equation (8.5.10) is 


1 Š 
Un (P) = 51Sm(F = fe) + Sm(f + fol. (8.5.11) 


Thus, the spectrum of the baseband signal s(t) is shifted in frequency by an amount 
equal to the carrier frequency f,. The result is a double-sideband suppressed-carrier (DSB- 
SC) amplitude-modulated signal, as illustrated in Figure 8.44. The upper sideband of the 
carrier-modulated signal contains the frequency content of um(t) for |f| > fe, i.e., for 
fe < |fl < fe + W. The lower sideband of um(t) includes the frequency content for 
IfK< fe, ie. for fe — W < |f| < fe. Hence, the DSB-SC amplitude-modulated signal 
occupiés.a channel bandwidth of 2W, which is twice the bandwidth required to transmit 
the baseband signal. 

The ekergy of the bandpass signal waveforms um(t),m = 1,2,..., M, which are 
given by Equation (8.5.10), is defined as 


foe) foe) 
Em =f u2 (t)dt = i s2(t) cos? 2n fet dt 
= 


=- sp (t)dt + = 5,,(t) cos 4r fet dt. (8.5.12) 
D hss 2 Jax 
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We note that when fe > W, the term 


2 


1 CO 
= | s2(t)cos4n fet dt (8.5.13 
CO 


involves the integration of the product of a slowly varying function, i.e., s2 (t), with < 
rapidly varying sinusoidal term, i.e., cos 4x ft. With an argument similar to the argumen 
following Equation (3.2.2), we conclude that the integral in Equation (8.5.13) over a single 
cycle of cos 47 fet is zero (see Figure 3.6); hence, the integral over an arbitrary number o: 
cycles is also zero. Consequently, the energy of the carrier-modulated signal is 


o0 


1 
Em = / s?(t) dt. (8.5.14 


Thus, we have shown that the energy in the bandpass signal is one-half of the energy in th 
baseband signal. The scale factor of 1/2 is due to the carrier component cos 2x f,t, whicl 
has an average power of 1/2. 

In M-ary amplitude-shift keying (ASK), the signal waveforms have the general forn 


Sm(t) = Amp(t) cos2nf,t, 1<m<M (8.5.15 


where p(t) isa baseband signal of duration T . It is seen that all signals are multiples of th 
same signal p(t) cos 21 fst and only the amplitude Am differentiates them. Therefore, al 
signals can be expressed in terms of a single normalized signal 





2 
= p(t)cos2n fet (8.5.16 
Ep 





This shows that the dimensionality o 


Example 8.5.2 
A carrier-modulated PAM signal may be expressed as 


e signals space for ASK signals is N = 1. 


Um(t) = Sm (t) cos 27 fot 
=Snv(t)cos2uf.t, 0<t <T, 
where Sm = (2m—1—M)d,m =1,2,...,M,andd = A\/G,, as given by Equation (8.5.9 
and y(t) is a unit energy pulse defined over the interval 0 < ¢ < T. Determine the bas: 


function for the carrier-modulated PAM signal and the corresponding signal points in a vect 
representation. 


Solution The carrier amplitude PAM signal may be expressed as 


un(t) = Tovet) = Senet) » m=1,2,..., M, 


where 


W(t) = V2W(t) cos2n fot 


and 
Sem = Sm/ V2 = (2m — 1 — M)d/V2, m=1,2,...,M. 
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8.5.2 Demodulation and Detection of Amplitude-Modulated PAM Signals 


The demodulation of a bandpass signal may be accomplished by means of correlation 
or matched filtering, as described for baseband channels. However, as we will observe 
from the development that follows in Section 8.8, the presence of the carrier introduces an 
additional complication in the demodulation of the signal. 

The transmitted signal in a signaling interval may be expressed as 


Um(t) =Sm(t)cos2n fct, m=1,2,...,M, (8.5.17) 
and the received signal has the form 
r(t) = um (£) + n(t), (8.5.18) 


where 
n(t) = ne(t) cos 27 fet — ns(t) sin 27 fet (8.5.19) 


is a bandpass noise process, as discussed in Section 5.3.3. 
By cross correlating the received signal r(t) with the basis function y(t) = 
2y(t) cos 27 fet, we obtain, at the sampling instant t = T, 


T 
y(T) = [ r(t)Welt) dt 


T T 
= ism | woos? an fat dt + V3 | nY (t) cos 2 fet dt 
0 $ 0 


eae eae a (8.5.20) 


v2 


where n represents the additive noise component at the output of the correlator. An identical 
result is obtained if a matched filter replaces the correlator to demodulate the received 
signal. 

As in the case of baseband PAM, for equiprobable messages, the optimum detector 
bases its decision on the distance metrics 


DO, Sm) = Y — Sem), m=1,2,..., M (8.5.21) 
or, equivalently, on the correlation metrics 
C (Y, Sm) = 2YSem — Shn- (8.5.22) 


8.5.3 Probability of Error for M -ary PAM 


In the case of M -ary PAM in baseband AWGN channels, the input to the detector is 
y=Sm +n, (8.5.23) 


where Sm denotes the mth wansmitted amplitude level, defined previously in Equation 
(8.5.9), and n is a Gaussian random variable with zero mean and variance of = No/2. 
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Si Si—1 Si-2 Si-3 Si-g `. Si-5 
tO —— —_—__—____———_e—___+—__—_e——_+__.. 
Ti Ti-1 Ti-2 Ti-3 Ti-4 

2d 


si- Signal point 
7;- thresholds 


Figure 8.45 Placement of thresholds at midpoints of successive amplitude levels. 


The average probability of error for equally probable amplitude levels can be determinec 
from the decision rule that finds the maximum of the correlation metrics: 


C(y, Sm) = 2ySm — S 
= 2(y — Sm/2)Sm. 


Equivalently, the optimum detector may compare the input y with a set of M — 1 thresholds 
which are placed at the midpoints of successive amplitude levels, as shown in Figure 8.45 
Thus, a decision is made in favor of the amplitude level that is closest to y. 

The placing of the thresholds as shown in Figure 8.45 helps evaluate the probabil 
ity of error. On the basis that all amplitude levels are equally likely a priori, the averag: 
probability of a symbol error is simply the probability that the noise variable n exceeds i1 
magnitude one-half of the distance between levels. However, when either one of the tw 
outside levels +(M — 1) is transmitted, an error can occur in one direction only. Thus 
we have 


(8.5.24 


Bp BIO ply cre ate d 
M >= M Y — Sm! > tH (Y — Sm > d) 


'M-1 
= ay PO sel > d) 





M-1 2 oe #2 
= =x" /No 
M Sanh o (8.5.25 
M-1 2 ce 


= —— en? dx 


M ve a/ 2d2/No 
= 0 (VaR), 


where 2d is the distance between adjacent signal points. 

The distance parameter d is easily related to the average transmitted signal energ: 
Recall that d = A/p and w is given by Equation (8.5.7) as Gay = AE p(M? — 1)/: 
Hence, 


go 


Say = Ê (M? — 1)/3, (8.5.2€ 


and the average probability of error is expressed as 


2(M — 1) bBo ; 
py = Yo ae om) | (8.5.2 





| 
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Since the average transmitted signal energy av = T Pay, where Pay is the average trans- 
mitted power, Py may also be expressed as a function of Pay. 

In plotting the probability of a symbol error for M-ary signals such as M-ary PAM, 
it is customary to use the average SNR/bit as the basic parameter. Since each signal carries 
k = log, M bits of information, the average energy per bit Spay is given by Spav = av/k 
and k =log,M. Equation (8.5.27) may be expressed as 


_ 2M -1) 6(log) M) Etay I? 

Py = 7; Q (| (M2 — 1) Np ; | (8.5.28) 
e ee One 

where pay/No is the average SNR/bit. Figure 8.46 illustrates the probability of a symbol 

error as a function of 10logio@pay/No with M as a parameter. Note that the case M = 2 
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Figure 8.46 Probability of a symbol error for PAM. 
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corresponds to the error probability for binary antipodal signals. We also observe that for 
a fixed error probability, say Py = 1075, the SNR/bit increases by over 4 dB for every 
factor of two increase in M. For large M, the additional SNR/bit required to increase M 
by a factor of two approaches 6 dB. Thus, each time we double the number of amplitude 
levels M, we can transmit one additional information bit; the cost measured in terms of 
additional transmitter power is 6 dB/bit for large M, i.e., the transmitter power must be 
increased by a factor of 4 (10log,) 4 = 6 dB) for each additional transmitted bit in order 
to achieve a specified value of Py. 


Example 8.5.3 
Using Figure 8.46, determine (approximately) the SNR/bit required to achieve a symbol erro; 
probability of Py = 1076 for M = 2, 4, and 8. 


Solution From observation of Figure 8.46, we know that the required SNR/bit is (approxi. 
mately) as follows: 


10.5dB for M =2 (1 bit/symbol); 
14.8dB for M = 4 (2 bits/symbol); 
19.2dB for M =8 (3 bits/symbol). 


Note that for these small values of M, each additional bit by which we increase (double) th 
number of amplitudes requires an increase in the transmitted signal power (or energy) by : 
little over 4 dB in order to maintain the same symbol error probability. For large values of M 
the argument of the function Q(x) in Equation (8.5.28) is the dominant term in the expressio1 
for the error probability. Since M = 2*, where k is the number of bits/symbol, increasing k b: 
1 bit to k + 1 requires that the energy/bit must be increased by a factor of 4 (6 dB) in order tı 
have the same value of the argument in Q(x). 1 


8.6 PHASE-SHIFT KEYING 


In the preceding section, we observed that bandpass signal waveforms, which are appro 
priate for wansmission on bandpass channels, were generated by taking a set of baseban 
signals and impressing them on the amplitude of a carrier. In this section, we generat 
bandpass signal waveforms by digitally modulating the phase of the carrier. 

In ASK, the general form of the wansmitted waveforms is Am p(t) cos 21 fet, i.e., th 
dependence of the signals to message m is through the amplitude Am. In phase-shift keyin 
(PSK), this dependence is through the phase of the signal m. In general, PSK waveform 


are of the following form: 
x 


“Um (t) = p(t) cos (2m fet + m), 


where p(t) is a baseband signal of duration T and $m is determined by the transmitte 
message. Since the signal phase m depends on the message m, thus the signal is phas 
modulated. Because the transmitted signals are different only in their phase, all PSK signa 
have the same energy and thus in the signal space representation they are all at the sarr 
distance from the origin. Therefore, the signal space representation of PSK signals is a sı 
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of M points on a circle of radius v€, and centered at the origin, where @, is the common 
energy of the PSK waveforms. It is convenient to select ¢,, values equally spaced as 


om = —, m=0,1,...,M-1 (8.6.1) 
In this case, the PSK waveforms have the general form of 
2 
Um(t) = p(t) cos (rfa + mm) , m=0,1,...,M —1. (8.6.2) 


For the special case where p(t) is a rectangular baseband pulse, we have 


Es 
p(t) = gr(t) = [Vr 0<t<T 


0 otherwise 
and 
J 28 cos (2n fet + Z2), O<1<T 
Um(t) = T 2 M?? = m=0,1,...,M—1, (8.6.3) 
0 otherwise 


where ©, is the energy in each signal and T is the signal duration. Since each signal carries 
k = log, M bits of information, we have €, = @;/log, M. Figure 8.47 illustrates a four- 
phase (M = 4) PSK signal waveform, usually called a quadrature PSK (QPSK) signal. 


By expanding the cosine term in Equation (8.6.3), we may express um (t) as 
Um(t) = gr(t)Amc cos 27 fet — gr(t)Ams sin 27 fet, (8.6.4) 
where 
Amc = cos2am/M, m=0,1,...,M—1, 
Ams = sin2um/M, m=0,1,...,M—1. 


(8.6.5) 


180° phase 0° phase —90° phase 
shift shift shift 


> ae: 


Figure 8.47 Example of a four-phase 
2T 3T 4T PSK signal. 
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cos 2nf,t 


Pulse- 
shaping 
filter 


(D Phase-modulated 
ET signal 


Pulse- 
shaping 
filter 





—sin 2nf,t 


Figure 8.48 Block diagram of a digital-phase modulator. 


Therefore, the modulator for the phase-modulated signal may be implemented a: 
shown in Figure 8.48, employing two quadrature carrier signals, where each quadrature 
carrier is amplitude modulated by the information-bearing signal. The pulse-shaping fil 
ters are designed to limit the spectrum of the transmitted signal to the allocated channe 
bandwidth. 


8.6.1 Geometric Representation of PSK Signals 


It follows from Equation (8.6.4) that digital phase-modulated signals can be represente 
geometrically as two-dimensional vectors with components /@,cos2mm/M an 
Vé; sin2nm/M, i.e., 


Sm = (Vē; cos 2um/M, V%,sin2xm/M) : (8.6.6 


Note that the orthonormal basis functions are 


t - 

pt) = Veer cos 2n fot 
l 1 f 

pt) =- gor) sin2n fot, 


where the energy of the pulse g(t) is normalized to 2@,. Signal-point constellations fo 
M =2?, 4, and 8 are illustrated in Figure 8.49. We observe that binary phase modulation i 
identical to binary ASK and belongs to the family of binary antipodal signaling. 

The mapping or assignment of k information bits into the M = 2% possible phase 
may be done in a number of ways. The preferred assignment is to use Gray encoding, i: 
which adjacent phases differ by one binary digit, as illustrated in Figure 8.49. Because th 
most likely errors caused by noise involve the erroneous selection of an adjacent phas 


and 
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M=2 


Figure 8.49 PSK signal constellations. 


_to the transmitted phase, only a single bit error occurs in the k-bit sequence with Gray 
encoding. 
The Euclidean distance between any two signal points in the constellation is 


dmn =N || 8m — Sn II 


= fe (1 — cos me”), 


and the minimum Euclidean distance (the distance between two adjacent signal points) is 
simply 


(8.6.7) 





20 TE 
dmin = Ps (: — cos =) = NEA sin TA (8.6.8) 


The minimum Euclidean distance dmin plays an important role in determining the 
error-rate performance of the receiver that demodulates and detects the information in the 
presence of additive Gaussian noise, as we have previously observed in Section 8.4.2. 


Example 8.6.1 


Consider the M = 2, 4, and 8 PSK signal constellations shown in Figure 8.49; all have the 
same transmitted signal energy Es. Determine the minimum distance dmin between adjacent 
signal points. For M = 8, determine by how many dB the transmitted signal energy 6, must 
be increased to achieve the same dmin as M = 4. 


Solution For M = 2, dnin = NER ; for M = 4, dmin = WA ; and for M = 8, 
dnin = NEA sin 1/8 
= V0.5868,. 
For M = 8, the energy 6, must be increased by the factor of 2/0.586 = 3.413 or 5.33 


dB, in order to achieve the same minimum distance as M = 4. 
For large values of M, sin n/M ~ m/M, so 


21 
dmin © M” s, M>2. 
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Consequently, when the number M of signal points is doubled, which allows us to transmi 
one additional information bit per symbol, the signal energy s must be increased by a fac 
tor of 4, or 6 dB, in order to maintain the same minimum distance between adjacent signa 
points. = l 


8.6.2 Demodulation and Detection of PSK Signals 


The received bandpass signal from an AWGN channel in a signaling interval 0 < t < 7 
may be expressed as 


r(t) =Uum(t) +n(t) 
= [Amcgr(t) + ne (t)]cos 2n fet — [Amsgr(t) + ns(t)] sin 27x fet (8.6.9 
m = 0,1,2,...,M - 1, 


where n(t) is the additive bandpass Gaussian noise represented in terms of its quadratur 
components nc(t) and n,;(t) as 
n(t) = ne(t) cos 2nfet — ns(t) sin 2x fet 


and Amc and Ams are the information-bearing signal components that are related to th 
transmitted carrier phase by Equation (8.6.5). 
The received signal may be correlated with 


1 

Wo = N g ET (t) cos 2x fet 
/1 l 

p(t) = — g sr) sin 2n fet. 


The outputs of the two correlators yield the two noise-corrupted signal component 
which may be expressed as 


and 


y =Smtn 


= (VE; cos2nm/M+ne, /€6s,sin2nm/M + ns) y eeu 
where, by definition, 
1 T 
Ne = zel 8r(t)n-(t) dt, 
s J0 
(8.6.1 
1 T 
n =e f ng(t)gr(t)dt. 
NV s JO 


Because the quadrature noise components ne(t) and ns(t) are zero mean and uncorr 
lated (see Property 4 of filtered noise in Section 5.3.3), it follows that E (nc) = E(ns) = 
and E (nens) = 0. 
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The variance of the noise components is 





1 T T 
E (ne) = E (ns) = zg i [ gr(t)gr(t)E(ne(t)ne(t)) dt dt 


Ne FS (8.6.12) 
= wS 87 (t) dt 


= No/2. 


The optimum detector projects the received signal vector onto each of the M pos- 
sible transmitted signal vectors {s,,} and selects the vector corresponding to the largest 
projection; thus, we compute the correlation metrics 


C(Y, Sm) =Y-Sm, m=0,1,...,M—1 (8.6.13) 


and select the signal vector that results in the largest correlation. 
Because all signals have equal energy, an equivalent detector metric for digital phase 
modulation is to compute the phase of the received signal vector y = (y1, y2), which is 


-1 X2 
y 


© = tan `~ (8.6.14) 
and select the signal from the set {Sm} whose phase is closest to ©. In the next section, we 
evaluate the probability of error based on the phase metric given by Equation (8.6.14). 


8.6.3 Probability of Error for Phase-Coherent PSK Modulation 


In this section, we shall evaluate the probability of error for M-ary phase modulation in 
AWGN with the optimum demodulator and detector. The optimum detector based on the 
phase metric given by Equation (8.6.14) will be used in the computation. We assume that 
a perfect estimate of the received carrier phase is available. Consequently, the performance 
that we derive is for ideal phase-coherent demodulation. 

Consider the case in which the transmitted signal phase is 9 = 0, corresponding to 
the signal uo(t). Hence, the transmitted signal vector is 


so = (VE 0) (8.6.15) 


and the received signal vector has the components 


y = V6s + Ne, 


8.6.16 
y2 = Ns. ( ) 


Because nc and ns are jointly Gaussian random variables, it follows that y; and y2 
are jointly Gaussian random variables with E(y1) = /@;, E(y2) = 0, and as =o2 = 


y2 
No/2 = o;. Consequently, z 


a 
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[vE] 





fy 1, y») = (8.6.17) 


2 
210; 


The detector metric is the phase © = tan~! y2/yı. The PDF of © is obtained by a change 


in variables from (y1, y2) to 
V = J} +33 


and (8.6.18) 


© = tan! 2, 
yı 


This change in variables yields the joint PDF 


fv.eQ, 0) = re aia Bs vc0s)/203 (8.6.19 


y 


Integration of f,,@(v, 0) over the range of v yields fo(8), i.e., 


o0 
fo(@) = [ fv,o(v, 0) dv (8.6.20 
1 a2 $ 2 
= Bae sin f. pe eV 2Ps cos 6) dy, (8.6.21 
0 


where, for convenience, we have defined the symbol SNR as ps = ©;/No. Figure 8.51 
illustrates f@(@) for several values of the SNR parameter ps when the transmitted phas: 
is zero. Note that f@(@) becomes narrower and more peaked about 0 = 0 as the SNR p 
increases. 

When uo(t) is transmitted, a decision error is made if the noise causes the phase tı 
fall outside the range [—1/M, n/M]. Hence, the probability of a symbol error is 


T/M 
Py =1- | fo (6) dd. (8.6.22 
—1/M 


In general, the integral of f@(@) does not reduce to a simple form and must be evaluate 
numerically, except for M = 2 and M = 4. 

An alternate form of the symbol error probability has been derived by Weistei 
(1974) and Pawula (1999), and may be expressed as 


1 n—n/M 5 Si 2 
Pye [ exp | samen d0. (8.6.22 
0 sin” 0 


For binary phase modulation, the two signals u(t) and w(t), are antipodal; henc: 


the error probability is 
2p 
P, = — |]. 8.6.24 
2=Q Xi No ( 
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—3.14—2.51 —1.88 —1.26—0.63 0.00 0.63 1.26 188 2.51 3.14 
8 function fo (9) for p, = 1, 2, 4, 10. 


Figure 8.50 Probability density 


When M = 4, we basically have two binary phase modulation signals in phase quadrature. 
With a perfect estimate of the carrier phase, there is no crosstalk or interference between 
the signals on the two quadrature carriers; hence, the bit error probability is identical to 
that in Equation (8.6.24). On the other hand, the symbol error probability for M = 4 is 
determined by noting that 


2 


P, =(1— Py)? =|1-@ [ae , (8.6.25) 
0 


where P, is the probability of a correct decision for the 2-bit symbol. The result in Equa- 
tion (8.6.25) follows from the statistical independence of the noise on the quadrature car- 
riers. Therefore, the symbol error probability for M = 4 is 


Pi =1-—P,, (8.6.26) 


26p 1 26, 
=2 — ->= — |]. 8.6.27 
P4=2Q | N, 1-352 Ni No ( ) 







































































































































































414 Digital Modulation in AWGN Channel Chapter , 


107! 


Py, Probability of error 
5 
wo bw 


-4 0 4 8 12 16 20 24 Figure 8.51 Probability of a symbol 
SNR/bit, dB error for PSK signals. 


Tf the signal-to-noise ratio is not too low, then 5 Q (Vē 7No) < 1, and we have 


2%, 


P4 X 2Q No 


(8.6.21 


For M > 4, the symbol error probability Pm is obtained by numerically integrating eith 
Equation (8.6.22) or Equation (8.6.23). Figure 8.51 illustrates this error probability as 
function of the SNR/bit for M = 2, 4, 8, 16, and 32. The graphs clearly illustrate tł 
penalty in the SNR/bit as M increases beyond M = 4. For example, at Py = 1075, tt 
difference between M = 4 and M = 8 is approximately 4 dB, and the difference betwee 
M = 8 and M = 16 is approximately 5 dB. For large values of M, doubling the numb 
of phases requires an additional.6 dB/bit to achieve the same performance. 

To approximate the error probability for large values of M and large SNR, we mu 
first approximate f@(@). For s/No > 1 and |O| < m/2, f,(8) is approximated as 


fod) = |Æ cosa en Pes? (8.6.2! 
T 
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By substituting for fo(@) in Equation (8.6.22) and performing the change in variable from 
6 to u = ./p; sin, we find that 


n/M P 3 
Py ®1 -f = cos 8 es im 9 go 
-n/M T 


2 f. —u?/2 
~ — e du 
V 20 J./2ps sinn/M 


=2Q ( 2ps Sin Z) 


M 

=20 (2ko sin =) (8.6.30) 

=20 | /2ksin? (=) h (8.6.31) 
0 


2x? log, M p 
x2 — = — ], 8.6.32 
Qf M No ( ) 


where k = log, M and ps = kp». Note that the last approximation holds for large M, 
where sin j, œ% ṣọ. We note that the approximations to the error probability given in 
Equations (8.6.30) and (8.6.31) are good for all values of M. For example, when M = 2 
and M = 4, we have P) = Py = 20 (v2), which compares favorably (a factor 
of 2 difference) with the exact error probability given by Equation (8.6.24). Also, from 
Equation (8.6.32), it is clear that due to the presence of M? in the denominator, doubling 
M deteriorates the performance by a factor of 4 (6 dB). This is similar to the performance 
of baseband and carrier-modulated PAM signals. 


xla 


Example 8.6.2 
By using the binary event error probability given in Equation (8.3.65) and the Euclidean dis- 
tance between two adjacent signal points in a PSK signal constellation, which is given by 
Equation (8.6.8), determine an approximation to the symbol error probability. Compare the 
result with that given by Equation (8.6.30). 


Solution First, we determine the error probability in selecting a particular signal point other 
than the transmitted signal point when the signal is corrupted by AWGN. From Equation 


(8.3.65), we have 
d2 
P, = 2 
=Q y No |’ 


where d?, is the square of the Euclidean distance between the transmitted signal point and 
the particular erroneously selected signal point. In the case of a PSK signal constellation, the 
error probability is dominated by the erroneous selection of either one of the two signal points 
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adjacent to the transmitted signal point. Consequently, an approximation to the symbol err 
probability is 
diin 


2No 





+ 


Py ~2Q 


where dmin is given by Equation (8.6.8). By substituting for dmin in the approximation to P, 
we obtain 


. 1 
Py ~20 ( 2p; sin =) 7 
which is identical to the expression given in Equation (8.6.30). 


The equivalent bit error probability for M-ary phase modulation is rather tedious 1 
derive due to its dependence on the mapping of k-bit symbols into the corresponding signi 
phases. When the Gray code is used in the mapping, two k-bit symbols corresponding | 
adjacent signal phases differ in only a single bit. Because the most probable errors due | 
noise result in the erroneous selection of an adjacent phase to the true phase, most k-b 
symbol errors contain only a single bit error. Hence, the equivalent bit-error probability fc 
M -ary phase modulation is well approximated as 

k 

The performance analysis just given applies to phase-coherent demodulation wit 
conventional (absolute) phase mapping of the information into signal phases. As indicate 
in Section 8.6.4, when phase ambiguities result in the estimation of the carrier phase, tt 
information symbols are differentially encoded at the transmitter and differentially decode 
at the receiver. Coherent demodulation of differentially encoded phase-modulated signa 
results in a higher probability of error than the error probability derived for absolute-pha: 
encoding. With differentially encoded signals, an error in the detected phase (due to nois 
will frequently result in decoding errors over two consecutive signaling intervals. This 
especially the case for error probabilities below 10~!. Therefore, the probability of err 
for differentially encoded M -ary phase-modulated signals is approximately twice the prol 
ability of error for M-ary phase modulation with absolute-phase encoding. However, 
factor-of-2 increase in the error probability translates into a relatively small loss in SN] 
as depicted in Figure 8.51. 


8.6.4 Differential Phase Encoding and Differential Phase Modulation 
and Demodulation 


The performance of ideal, coherent phase modulation and demodulation is closely attains 
in communication systems that transmit a carrier signal along with the information signi 
The carrier-signal component, usually referred to as a pilot signal, may be filtered fro 
the received signal and used to perform phase-coherent demodulation. However, when 1 
separate carrier signal is transmitted, the receiver must estimate the carrier phase from tl 
received signal. As indicated in Section 8.8.1, the phase at the output of a phase-lock: 
loop (PLL).has ambiguities of multiples of 2n/M, necessitating the need to differential 
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encode the data prior to modulation. This differential encoding allows us to decode the 
received data at the detector in the presence of the phase ambiguities. 

In differential encoding, the information is conveyed by phase shifts between any 
two successive signal intervals. For example, in binary-phase modulation, the information 
bit 1 may be transmitted by shifting the phase of the carrier by 180° relative to the previous 
carrier phase, while the information bit 0 is transmitted by a zero-phase shift relative to the 
phase in the preceding signaling interval. In four-phase modulation, the relative phase shifts 
between successive intervals are 0°, 90°, 180°, and 270°, corresponding to the information 
bits 00, 01, 11, and 10, respectively. The generalization of differential encoding for M > 4 
is straightforward. The phase-modulated signals resulting from this encoding process are 
called differentially encoded. The encoding is performed by a relatively simple logic circuit 
preceding the modulator. 

Demodulation and detection of the differentially encoded phase-modulated signal 
may be performed as described in Section 8.8.4 using the output of a PLL to perform the 
demodulation. The received signal phase ®© = tan! y2/y, at the detector is mapped into 
one of the M possible transmitted signal phases {0} that is closest to ©. Following the 
detector, there is a relatively simple phase comparator that compares the phases of the 
detected signal over two consecutive intervals to extract the transmitted information. Thus, 
phase ambiguities of 2x/M are rendered irrelevant. 


Differential Phase-Shift Keying. A differentially encoded phase-modulated sig- 
nal also allows a type of demodulation that does not require the estimation of the carrier 
phase. Instead, the phase of the received signal in any given signaling interval is compared 
to the phase of the received signal from the preceding signaling interval. To elaborate, sup- 
pose that we demodulate the differentially encoded signal by multiplying r (t) by cos 21 fot 
and sin 2x f-t and integrating the two products over the interval T. At the kth signaling 
interval, the demodulator output is given by the complex number 


Ye = VEs I + ny, (8.6.34) 


where 6; is the phase angle of the transmitted signal at the kth signaling interval, ¢ is the 
carrier phase, and ng = Ake + jnxs is the noise vector. Similarly, the received signal vector 
at the output of the demodulator in the preceding signaling interval is the complex number 


Ye-1 = VS, eI OHAY + nga. (8.6.35). 


The decision variable for the phase detector is the phase difference between these two 
complex numbers. Equivalently, we can project y, onto yg- and use the phase of the 
resulting complex number, i.e., 


WYE = Bs FHM + SE AOAR a + (Be FO ny nni (8.6.36) 


which, in the absence of noise, yields the phase difference 6; — 6,—;. Thus, the mean value 
of y,yz_, is independent of the carrier phase. Differentially encoded PSK signaling that is 
demodulated and detected as just described is called differential PSK (DPSK). 
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Figure 8.52 Block diagram of a DPSK demodulator. 


The demodulation and detection of DPSK using matched filters is illustrated i 
Figure 8.52. If the pulse gr (t) is rectangular, the matched filters may be replaced by inte 
grators, which are also called integrate-and-dump filters. 


8.6.5 Probability of Error for DPSK 


Now consider the evaluation of the error probability performance of a DPSK demodulatc 
and detector. The derivation of the exact value of the probability of error for M -ary DPS] 
is extremely difficult, except for M = 2. The major difficulty is encountered in the determi 
nation of the PDF for the phase of the random variable y,y;_,, given by Equation (8.6.36 

Pawula et al. (1982) have shown that the probability of a symbol error for M-ar 
DPSK may be expressed in the integral form 


1 sed ps sin? (n/M) 
Pu = — ——— MM | do. 8.6.37 
X [ SSE l 1 + cos(x /M) cos 8 ( 


This expression can be computed numerically to yield the symbol error probability. 

An approximation to the performance of DPSK is easily obtained, as we now demor 
strate. 

Without loss of generality, suppose the phase difference 0 — 0,—1 = 0. Furthermor: 
the exponential factors e~/(-1-) and e/%-®) in Equation (8.6.36) can be absorbed int 
the Gaussian noise components ng—ı and ng, as shown in Problem 5.28, without changin 
their statistical properties. Therefore, y,y,_, in Equation (8.6.36) can be expressed as 


WY = Bs + VEs (nk + n1) + nng. (8.6.3: 


The complication in determining the PDF of the phase is the term nynj_,. However, i 
SNRs of practical interest, the term nyn;_, is small relative to the dominant noise teri 
VE5 (ne + nx_,). If we neglect the term nynj_, and we also normalize y,y¢_, by dividin 
through by /%,, the new set of decision metrics becomes 
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y= V65 + Re(nk + ný)» 


yo =Im(m + nž_)- (8.6.39) 


The variables y; and y, are uncorrelated Gaussian random variables with identical vari- 
ances o? = No. The phase is 
@ = tan! 2, (8.6.40) 

At this stage, we have a problem that is identical to the one we previously solved for phase- 
coherent demodulation and detection of PSK. The only difference is that the noise variance 
is now twice as large as in the case of PSK. Thus, we conclude that the performance of 
DPSK is 3 dB worse than that for PSK. This result is relatively good for M > 4, but it is 
pessimistic for M = 2 in the sense that the loss in binary DPSK, relative to binary PSK, is 
less than 3 dB at large SNR. 

In binary DPSK, the two possible transmitted phase differences are zero and v radi- 
ans. As a consequence, only the real part of y,y;_, is needed for recovering the informa- 
tion. We express the real part as 


1 
Re(yxyg_1) = z- + ye ye1)- (8.6.41) 


Because the phase difference between the two successive signaling intervals is zero, an 
error is made if Re(y, yý) is less than zero. A rigorous analysis of the error probability of 
DPSK based on computing the probability of y, yf_, + yg 9;,_, < 0 results in the following 
expression: 
P = ao (8.6.42) 

Here, pp = @,/ No is the SNR/bit. 

The graph of Equation (8.6.42) is shown in Figure 8.53. Also shown in this figure is 
the probability of error for binary PSK. We observe that, at error probabilities below 1074, 
the difference in SNR between binary PSK and binary DPSK is less than 1 dB. Since DPSK 
does not require the estimation of the carrier phase, the need for a phased-locked loop is 
eliminated, and the implementation of the demodulator is simplified. Given the relatively 
small difference in performance between binary DPSK and binary PSK, binary DPSK is 
often preferable in practice. 





= 8.7 QUADRATURE AMPLITUDE-MODULATED DIGITAL SIGNALS 


In our discussion of carrier-phase modulation, we observed that the bandpass signal wave- 
forms may be represented as given by Equation (8.6.4), in which the signal waveforms are 
viewed as two orthogonal carrier signals, cos 2v fet and sin 2x fet, which are amplitude 
modulated by the information bits. However, the carrier-phase modulation signal wave- 
forms are constrained to have equal energy s, which implies that the signal points in the 
geometric representation of the signal waveforms lie on a circle of radius @,. If we remove 
the constant energy restriction, we can construct two-dimensional signal waveforms whose 
vector representation is not constrained to fall on a circle. 
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f Figure 8.53 Probability of error for 
SNR/bit, dB binary PSK and DPSK. 


The easiest way to construct such signals is to impress separate information bits o 
each of the quadrature carriers, cos 2x fet and —sin 27 f,t. This type of digital modulatio 
is called quadrature amplitude modulation (QAM). We may view this method of info: 
mation transmission as a form of quadrature-carrier multiplexing, previously described i 
Section 3.4.2. 

The transmitted signal waveforms have the form 


Um(t) = Amc&r (t) cos 27 fet — Ams8r (t) sin2nfct, m=1,2,...,M, (8.7.1 


where {Amc} and {Ams} are the sets of amplitude levels that are obtained by mappin 
k-bit sequences into signal amplitudes. For example, Figure 8.54 illustrates a 16-QA} 
signal constellation that is obtained by amplitude modulating each quadrature carrier b 
M = 4 PAM. In general, rectangular signal constellations result when two quadratw 
carriers are each modulated by PAM. Figure 8.55 illustrates the functional block diagra 
of a modulator for QAM implementation. 

More generally, QAM may be viewed as a form of combined digital-amplituc 
and digital-phase modulation. Thus, the transmitted QAM signal waveforms may be expre 
sed as 


Umn(t) = Amgr(t) cos(2x fet + O,),m =1,2,...,Mi;n=1,2,..., M2. (8.7. 
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Figure 8.54 M = 16 QAM signal constellation. 
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Figure 8.55 Functional block diagram of a modulator for QAM. 


If Mı = 24 and M2 = 2*2, the combined amplitude- and phase-modulation method results 
in the simultaneous transmission of kj + k2 = log, Mı M2 binary digits occurring at a 
symbol rate Rp/(k; + k2). 


8.7.1 Geometric Representation of QAM Signals 


It is clear that the geometric signal representation of the signals given by Equations (8.7.1) 
and (8.7.2) is in terms of two-dimensional signal vectors of the form 


Sin = (VEs Ane, V8sAms) > M= 1, 2, Loses M, (8.7.3) 
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and the orthogonal basis functions are 


yit) = [eer cos 2n fet, 
1 : 
y(t) = -Jro sin 2n fot. 


It should be noted that M = 4 rectangular QAM and M = 4 PSK are identical sign 
constellations. 

Examples of signal-space constellations for QAM are shown in Figure 8.56. 

The average transmitted energy for those signal constellations is simply the sum c 
the average energies in each of the quadrature carriers. For the signal constellations, a 
shown in Figure 8.56, the average energy per symbol is given as 


(8.7.4 


1 M 
= 2 
Ea = 2 Isi I. (8.7.5 


The distance between any pair of signals points is 


dmn = V II Sm — Sn l2. (8.7.€ 


Example 8.7.1 
Consider the M = 8 rectangular signal constellation and the M = 8 combined PAM-PSK si; 
nal constellation, as shown in Figure 8.57(a) and (b). Assuming that adjacent signal points i 
each of the two signal constellations are separated by a distance d = 2, determine the averag 
energy per symbol for each constellation. Which signal constellation is preferable? Why? 


Solution For the M = 8 signal constellation shown in Figure 8.57(a), the coordinates of tt 
signal points are (+1, +1) and (+3, +1). Therefore, the average energy per symbol is 


Sa = Euo + 4(10)] = 6. 


For the M = 8 signal constellation shown in Figure 8.57(b), the coordinates of tł 
signal points are (+1, +1) and (1 + V3, 0), (—1 — v3, 0), (0, 1 + v3), and (0, —1 — v3 
Therefore, the average energy per symbol is 


Ca = 142) + 4(7.464)] = 4.73. 


This signal constellation is preferable because it requires 1 dB less energy per symbol 
achieve the same performance as that of the rectangular signal constellation. 
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Figure 8.56 (a) Rectangular signal-space constellations for QAM. (b, c) Examples of combined PAM-PSK 
signal-space constellations. 


8.7.2 Demodulation and Detection of QAM Signals 


The received QAM signal is corrupted by additive Gaussian noise. Hence, r(t) may be 
expressed as 


r(t) = Ame&r(t) cos(27 fet) — Ams 87 (t) sin(2 fot) + n(t). (8.7.7) 
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il Figure 8.57 Two M = 8 QAM signal point constellations. 


The received signal r(t) is cross correlated with the functions given by Equation 
(8.7.4), and the outputs of the cross correlators is sampled at time t = T to yield the vecto 


Y =Sm +n (8.7.8 
= (ve, Ame + Ne, Behn: + ns) . (8.7.9 


The optimum detector computes the distance metrics 




















DY, Sm) =|| Y — Sm Ñ, m=1,2,..., M (8.7.1C 


| and selects the signal corresponding to the smallest value of D(y, Sm). If a correlatio 
metric is used in place of a distance metric, it is important to recognize that correlatio 
metrics must employ bias correction because the QAM signals are not equal energy signal: 


8.7.3 Probability of Error for QAM 


it To determine the probability of error for QAM, we must specify the signal-point conste 
lation. We begin with QAM signal sets thathave M = 4 points. Figure 8.58 illustrates tw 
| 4-point signal sets. The first is a four-phase modulated signal and the second is a QA) 
ih signal with two amplitude levels, labeled ./$, and ./€ , and four phases. Because th 
iI probability of error is dominated by the minimum distance between pairs of signal point: 
we impose the condition that dmin = 2/8, for both signal constellations; we also evaluat 
ill the average energy, based on the premise that all signal points are equally probable. For th 
| four-phase signal, we have 



































Bay = 7(4)2B, = 2%,. (8.7.11 
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| d=V€, +8 =2V%, 
Ne €, = 8p 8 = 38, 


d= 2V, N 


(a) (b) 


Figure 8.58 Two 4-point signal constellations. 


For the two-amplitude four-phase QAM, we place the points on circles of radii V€; and 
Vp. Thus, with the constraint that dmin = 2/%, the average energy is 


1 
tae [288s + 28,] = 28s, (8.7.12) 


which is the same average energy as the M = 4-phase signal constellation. Hence, for all 
practical purposes, the error-rate performance of the two signal sets is the same. In other 
words, the two-amplitude QAM signal set has no advantage over M = 4-phase modulation. 

Next, consider M = 8 QAM. In this case, there are many possible signal constella- 
tions. The four signal constellations shown in Figure 8.59 consist of two amplitudes and 
have a minimum distance between signal points of 2/€;. The coordinates (Amc, Ams) for 
each signal point, normalized by /6,, are given in the figure. Assuming that the signal 
points are equally probable, the average transmitted signal energy is 


Se = (2, +a), (8.7.13) 


m=1 


where (amc, Ams) are the normalized coordinates of the signal points. 

In Figure 8.59, the two signal sets (a) and (c) contain signal points that fall on a 
rectangular grid and have Gay = 6€;. The signal set (b) requires an average transmitted 
signal av = 6.82@;, and the fourth requires av = 4.73@;. Therefore, to achieve the same 
probability of error, the signal set (d) requires approximately 1 dB less energy than (a) 
and (c) and 1.6 dB less energy than (b). This signal constellation is known to be the best 
eight-point QAM constellation because it requires the least energy for a given minimum 
distance between signal points. 

For M > 16, there are many more possibilities for selecting the QAM signal points 
in the two-dimensional space. Forexample, we may choose a circular multiamplitude con- 
stellation for M = 16, as shown in Figure 8.60. In this case, the signal points at a given 
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| | Figure 8.59 Four 8-point QAM signal constellations. 













































































Figure 8.60 Circular 16-point QAM signal 
M=16 constellation. 
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amplitude level are phase rotated by 1/4 relative to the signal points at adjacent ampli- 
tude levels. This 16-QAM constellation is a generalization of the optimum 8-QAM con- 
stellation. However, the circular 16-QAM constellation is not the best 16-QAM signal 
constellation for the AWGN channel. 

Rectangular QAM signal constellations have the distinct advantage of being easily 
generated as two PAM signals impressed on phase-quadrature carriers. In addition, they are 
easily demodulated as previously described. Although they are not the best M-ary QAM 
signal constellations for M > 16, the average transmitted energy required to achieve a 
given minimum distance is only slightly greater than the average energy required for the 
best M-ary QAM signal constellation. For these reasons, rectangular M-ary QAM signals 
are most frequently used in practice. 

For rectangular signal constellations in which M = 2* where k is even, the QAM 
signal constellation is equivalent to two PAM signals on quadrature carriers, each hav- 
ing VM = 2*/? signal points. Because the signals in the phase-quadrature components 
are perfectly separated by coherent detection, the probability of error for QAM is easily 
determined from the probability of error for PAM. Specifically, the probability of a correct 
decision for the M-ary QAM system is 


P, = (1— Py)’, (8.7.14) 


where P 7 is the probability of error of a ./M-ary with one-half the average energy in 
each quadrature signal of the equivalent QAM system. By appropriately modifying the 
probability of error for M -ary PAM, we obtain 


l 3 Say 
P—==2 eee ——— |, 8.7.1 
JM (1 =) Q MIM, (8.7.15) 


where @,,/No is the average SNR/symbol. Therefore, the probability of a symbol error for 
the M-ary QAM is 


Py =1—(1—Pyq)- (8.7.16) 


We note that this result is exact for M = 2* when k is even. On the other hand, when k 
is odd, the signal constellation is no longer square; therefore, there is no equivalent M-ary 
PAM system. This is no problem, however, because it is rather easy to determine the error 
rate for a rectangular signal set. If we employ the optimum detector that bases its decisions 
on the optimum distance metrics given by Equation (8.7.10), it is relatively straightforward 
to show that the symbol error probability is tightly upper bounded as 


(8.7.17) 
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for any k > 1, where Gbav/No is the average SNR/bit. The probability of a symbol error 
plotted in Figure 8.61 as a function of the average SNR/bit. 

It is interesting to compare the performance of QAM with that of phase modulatic 
for any given signal size M, because both types of signals are two-dimensional. Recall th 
Hh for M-ary phase modulation, the probability of a symbol error is approximated as 



































Py x20 ( Zp; sin =) (8.7.1 


where p, is the SNR/symbol. For M-ary QAM, we may use the expression in Equ 
tion (8.7.17). Because the error probability is dominated by the argument of the Q-functic 
we may simply compare the arguments of Q for the two signal formats. Thus, the ratio 
these two arguments is 





_ 3/(M—1) 


Zu = ——.. 
m 2 sin? x/M 


(8.7.1 
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TABLE 8.1 SNR ADVANTAGE OF 
M-ARY QAMOVER M-ARY PSK. 


M 10 logom 
8 1.65 

16 4.20 

32 7.02 

64 9.95 





For example, when M = 4, we have Æm = 1. Hence, 4-PSK and 4-QAM yield compa- 
rable performance for the same SNR/symbol. On the other hand, when M > 4, we find 
that m > 1; thus, M-ary QAM yields better performance than M-ary PSK. Table 8.1 
illustrates the SNR advantage of QAM over PSK for several values of M. For example, we 
observe that 32-QAM has a 7-dB SNR advantage over 32-PSK. 


Example 8.7.2 


From the expression for the symbol error probability of QAM, which is given by Equa- 
tion (8.7.17), determine the increase in the average energy per bit Cray required to maintain 
the same performance (error probability) if the number of bits per symbol is increased from k 
to k + 1 where k is large. 


Solution Since M = 2*, increasing k by one additional bit means that the number of signal 
points must be increased from M to 2M. For M large, we observe that increasing M to 2M 
in the argument of the Q-function in Equation (8.7.17) requires that we increase the average 
energy per bit from pav to 2Gbav; this allows the error probability to remain the same. There- 
fore, in QAM, increasing the number of bits per symbol by one bit causes an additional 3 dB 
increase in the transmitted signal energy to maintain the same error probability. This is a factor 
of two smaller in energy than PSK for large M. Therefore, QAM is more energy efficient than 
PSK for M > 4, as indicated in Table 8.1. a 





.8 CARRIER-PHASE ESTIMATION 


In this section, we treat methods for estimating the carrier phase that is required for phase- 
coherentdemodulation of the received signal. In particular, we consider methods for carrier- 
phase estimation from the received information-bearing modulated signal in which the data 
is conveyed by either PAM, PSK, or QAM. 


8.8.1 The Phase-Locked Loop 


We begin by describing a method for generating a phase reference for synchronous (phase- 
coherent) demodulation of a DSB-SC AM signal. The received noise-corrupted signal at 
the input to the demodulator is given by 


r(t) =u(t) + a(t) 
= A,m(t)cos(2n fet + $) +n(t), (8.8.1) 


where m(t) is the message signal, which is assumed to be a sample function of a zero-mean 
random process M (t), e.g., m(t) may be a baseband PAM signal. 
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First, we note that the received signal r (t) has a zero mean, since the message proces 
is zero mean, i.e., M(t) contains no DC component. Consequently, the average power ; 
the output of a narrowband filter tuned to the carrier frequency fe is zero. This fact implie 
that we cannot extract a carrier-signal component directly from r (t). 

If we square r(t), the squared signal contains a spectral component at twice th 
carrier frequency. That is, 


r(t) = A? M° (t) cos? (27 fst + Q) + noise terms 


1 1 
= z4M? (t) + 7AM) cos(4r fst + 2¢) 
+ noise terms. (8.8.2 


Since E(M?(t)) = Ry(0) > 0, there is signal power at the frequency 2 fz, which can t 
used to drive a phase-locked loop (PLL). 

In order to isolate the desired double-frequency component from the rest of the fr 
quency components, the squared input signal is passed through a narrowband filter that 
tuned to the frequency 2 f,. The mean value of the output of such a filter is a sinusoid wit 
frequency 2 f,, phase 2¢, and amplitude A2E (M?(t))H (2f.)/2, where H(2 fe) is the gai 
(attenuation) of the filter at f = 2f.. Thus, squaring the input signal produces a sinusoid 
component at twice the carrier frequency, which can be used as the input to a PLL. TI 
general configuration for the carrier-phase estimation system is illustrated in Figure 8.62 

The PLL consists of a multiplier, a loop filter, and a voltage-controlled oscillat: 
(VCO), as shown in Figure 8.63. If the input to the PLL is the sinusoid cos(4x fet + X% 
and the output of the VCO is sin(4nf,t + 2¢), where ĝ represents the estimate of œ, tl 
product of these two signals produces the signal 


elt) =cos(4nfet + 2¢) sin(4nfet +26) 
= ; sin2(¢ — ) + ; sin(8x fet + 26 +2¢). (8.8. 


Note that e(t) contains a low-frequency term (DC) and a term at four times the carrier. 


Band 
r(t) | Square-law aie “a PLL 
device tuned to 2f, 
| Figure 8.62 System for carrier-phi 


Output estimation. 





Input Loop 
signal filter 
VCO 


Figure 8.63 Basic elements of a PLL. 
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_ The loop filter is a lowpass filter that responds only to the low-frequency component 
sin(@ — $) and removes the component at 4 fe. This filter is usually selected to have the 
relatively simple transfer function 


1+ ts 


aey= 1+ts’ 





(8.8.4) 


where the time constants q; and tz are design parameters (tı >> t2) that control the band- 
width of the loop. A higher-order filter that contains additional poles may be used, if nec- 
essary, to obtain a better loop response. 

The output of the loop provides the control voltage for the VCO, whose implemen- 
tation is described in Section 4.3 in the context of FM modulation. The VCO is basically a 
sinusoidal signal generator with an instantaneous phase given by 


t 
4T fet +20 =4n fet +K / v(t)dt, (8.8.5) 
—00 
where K is a gain constant in radians per volt-second. Hence, the carrier-phase estimate at 
the output of the VCO is 


26 =K f i v(r)dt (8.8.6) 


and the wansfer function of the VCO is K/s. 

Since the double-frequency term resulting from the multiplication of the input signal 
to the loop with the output of the VCO is removed by the loop filter, the PLL may be 
represented by the closed-loop system model shown in Figure 8.64. 

The sine function of the phase difference 21¢—¢) makes the system nonlinear and, as 
a consequence, the analysis of the PLL performance in the presence of noise is somewhat 
involved, although it is mathematically tractable for simple loop filters. 

In a steady-state operation when the loop is tracking the phase of the received carrier, 
the phase error d — ¢ is small; hence, 


1 y i 
5 sing — 6) x$- 9. (8.8.7) 


With this approximation, the PLL is represented by the linear model shown in Figure 8.65. 
This linear model has a closed-loop transfer function 


KG(s)/s 


H(s) = (8.8.8) 


~ 1+ KG(s)/s° 





Figure 8.64 Model ofa PLL. 
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Loop 
oS filter 
~ G(s) 





K <—_—. 
s 


Figure 8.65 Linear model for a PLL. 


By substituting from Equation (8.8.4) for G(s) into Equation (8.8.8), we obtain 


= 1+ ts 
1+(tm2+%)s+ Hs? 











H(s) (8.8. 

Hence, the closed-loop system function for the linearized PLL is second order when tł 

loop filter has a single pole and a single zero. The parameter t) determines the position ı 

the zero in H (s), while K, t1, and T2 control the position of the closed-loop system pole 
The denominator of H (s) may be expressed in the standard form 


D(s) = 8 + 2fan5 + 02, (8.8.1 


where ¢ is called the loop-damping factor and w, is the natural frequency of the loop. ` 
terms of the loop parameters, w, = /K/t; and € = @,(t + 1/K)/2, the closed-loc 
transfer function becomes 


(26, — w?/K)s + o? 


H = 
(s) S? +2 wns +w? 


(8.8.1 
The magnitude response 20 log |H( jæ)| as a function of the normalized frequen: 
w/æwn is illustrated in Figure 8.66, with the damping factor as a parameter and tı >> 1. Nc 
that ¢ = 1 results in a critically damped loop response, ¢ < 1 produces an underdampı 
loop response, and ¢ > 1 yields an overdamped loop response. 
The (one-sided) noise-equivalent bandwidth of the loop is (see Problem 8.42) 
EE t2 (1/12 + Kin) _ 1+ mon)? (88.1 
4 (t aE x) 85 / On : 
In practice, the selection of the bandwidth of the PLL involves a trade-off between t 
speed of response and the noise in the phase estimate. On the one hand, we want to sele 
the bandwidth of the loop to be sufficiently wide in order to wack any time variations 
the phase of the received carrier. On the other hand, a wideband PLL allows more noise 
pass into the loop, which corrupts the phase estimate. Next, we assess the effects of noi 
in the quality of the phase estimate. 


Effect of Additive Noise on Phase Estimation. In order to evaluate the effe 
of noise on the estimate of the carrier phase, let us assume that the PLL is tracking 
sinusoid signal of the form 


S(t) = A, cos[2n fet + ¢(t)], (8.8.1 
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Figure 8.66 Frequency response of a second-order loop. 


which is corrupted by the additive narrowband noise 
n(t) = ne(t) cos 2r fet — ns(t) sin 2r fet. (8.8.14) 


The in-phase and quadrature components of the noise are assumed to be statistically inde- 
pendent, stationary Gaussian noise processes with (two-sided) power spectral density No/2 
W/Hz. By using simple trigonometric identities, the noise term in Equation (8.8.14) can be 
expressed as 


n(t) = x,(t) cos[2n fet + O(t)] — xs(t) sinf2n fet + ¢(t)), (8.8.15) 
where 
x(t) =n,(t) cos d(t) +ns(t) sin g(t) 
and 
x;(t) = —n,(t) sin ġ (t) + ns(t) cos P(t). (8.8.16) 
We note that 
Xe(t) + jxs(t) = [ne (t) + jns (te? (8.8.17) 


It is easy to verify that a phase shift does not change the first two moments of n,(t) and 
ns(t), so that the quadrature components x,(t) and xs(t) have exactly the same statistical 
characteristics as n,(t) and n;(t). (See Problem 5.28.) 
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Noise 
A,sinAd be 
VCO 





Figure 8.67 Equivalent mode 
of PLL with additive noise. 


Now, if s(t) + n(¢) is multiplied by the output of the VCO and the double frequenc: 
terms are neglected, the input to the loop filter is the noise-corrupted signal 


e(t) = Ac sin Ag + x,(t) sin Ad — x5(t) cos Ad, (8.8.18 


where, by definition, Ad = ĝ — ¢ġ is the phase error. Thus, we have the equivalent mode 
for the PLL with additive noise, as shown in Figure 8.67. 

When the power P, = A2/2 of the incoming signal is much larger than the nois 
power, the phase estimate ĝ ~ @. Then, we may linearize the PLL; thus, we can easil 
determine the effect of the additive noise on the quality of the estimate ĝ. Under thes 
conditions, the model for the linearized PLL with additive noise will appear as illustrate 
in Figure 8.68. Note that the gain parameter A, may be normalized to unity, provided th: 
the noise term is scaled by 1/A,. Thus, the noise term becomes 


zO j sin Ag - #8 zo = cos Ag. (8.8.16 





nı(t) = 


Since the noise n1(t) is additive at the input to the loop, the variance of the phas 
error Ad, which is also the variance of the VCO output phase, is 


= —— (8.8.2( 


where Bpeq is the (one-sided) equivalent-noise bandwidth of the loop, given by Equ: 
tion (8.8.12). Note that A2/2 is the power of the input sinusoid and o5 is simply the rat: 


Noise 


Figure 8.68 Linearized model 
of PLL with additive noise. 
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ofthe total noise power within the bandwidth of the PLL divided by the input signal power. 
Hence, 


Bae (8.8.21) 
ee 8. 
where pz is defined as the signal-to-noise ratio 
A?2/ 2 
= —s—_., 8.8.22 
PL Broa No/2 ( ) 


Thus, the variance of ĝ is inversely proportional to the SNR. 

The expression for the variance o? of the VCO phase error applies to the case where 
the SNR is sufficiently high so that the linear model for the PLL applies. An exact analysis 
based on the nonlinear PLL is mathematically tractable when G(s) = 1, which results in 
a first-order loop. In this case, the probability density function for the phase error can be 
derived [see Viterbi (1966)], and has the form 


exp(z cos Ad) 
2x 1o(pr) 


where pz is the SNR defined in Equation (8.8.22), Bneg is the appropriate noise-equivalent 

bandwidth of the first-order loop, and Jo(-) is the modified Bessel function of order zero. 
From the expression for f (A@), we may obtain the exact value of the variance o? 

for the phase error of a first-order PLL. This is plotted in Figure 8.69 as a function of 1/p,. 


f(Ad) = (8.8.23) 
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Also shown for comparison is the result obtained with the linearized PLL model. Note that 
the variance for the linear model is close to the exact variance for pz, > 3. Hence, the linear 
model is adequate for practical purposes. 

Approximate analysis of the statistical characteristics of the phase error for the non- 
linear PLL have also been performed. Of particular importance is the transient behavior of 
the nonlinear PLL during initial acquisition. Another important problem is the behavior of 
the PLL at low SNR. It is known, for example, that when the SNR at the input to the PLL 
drops below a certain value, there is a rapid deterioration in the performance of the PLL. 
The loop begins to lose lock and an impulsive type of noise, characterized as clicks, is 
generated; this degrades the performance of the loop. Results on these topics can be found 
in the texts by Viterbi (1966), Lindsey (1972), Lindsey and Simon (1973), and Gardnei 
(1979), and in the survey papers by Gupta (1975) and Lindsey and Chie (1981). 

Now that we have established the effect of noise on the performance of the PLL, let us 
return to the problem of carrier synchronization based on the system shown in Figure 8.70 
The squaring of the received signal that produces the frequency component at 2f, alsc 
enhances the noise power level at the input to the PLL; thus, it increases the variance o: 
the phase error. 

To elaborate on this point, let the input to the squarer be u(t) + n(t). The output is 


y(t) = u? (t) + 2u(t)n(t) + nA). (8.8.24 


The noise terms are 2u(t)n(t) and n?(t). By computing the autocorrelation and powe 
spectral density of these two noise components, we can show that both components haw 
spectral power in the frequency band centered at 2 f,. Consequently, the bandpass filter witl 
bandwidth Bneq centered at 2 fe, which produces the desired sinusoidal signal componen 
that drives the PLL, also passes noise due to these two noise terms. 


cos (4nf,t + 26) 















O Square-law 20) Bandpass 

s device s filter Loop 
(full-wave tuned to filter 
rectifier) 2fe — 






sin (4nf,t + 2) 


sin (2nf,t + $) a 
— +2 
Output to coherent 
demodulator Frequency 


divider 


Figure 8.70 Carrier recovery using a square-law device. 
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Let us select the bandwidth of the loop to be significantly smaller than the bandwidth 
Bop of the bandpass filter, so that the total noise spectrum at the input to the PLL may be 
approximated by a constant within the loop bandwidth. This approximation allows us to 
obtain a simple expression for the variance of the phase error as 
1 


2 
o; = —, (8.8.25 
$ PLSL ) 


where Sz is called the squaring loss and is given as 
1 


i+ niet 


SL = (8.8.26) 


Since S; < 1, we have an increase in the variance of the phase error; this is caused by 
the added noise power that results from the squaring operation. For example, when pz; = 
Bbp/2Bneq, the loss is 3 dB or, equivalently, the variance in the estimate increases by a 
factor of two. 

Finally, we observe that the output of the VCO from the squaring loop must be 
frequency divided by a factor of two and phase shifted by 90° to generate the carrier signal 
for demodulating the received signal. 


8.8.2 The Costas Loop 


A second method for generating a properly phased carrier for a double-sideband 
suppressed-carrier (DSB-SC) AM signal is illustrated by the block diagram in Figure 8.71. 


a | Lowpass | 
b r] | fter | 


cos (2n fot + ĝ) 


shift 


| 90° phase 





s(t) 


on e(t) 
| fiter | 





sin (2x fet + 4) 


re | | Lowpass | 
i | fter | 


Figure 8.71 Block diagram of Costas loop. 
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The received signal 
r(t) = A.m(t) cos(2n fet + $) +n(t) 


is multiplied by cos(2x fet + ĝ) and sin(2x fot + ĝ) , Which are outputs from the VCO. The 
two products are 
Ye(t) = [A-m(t) cos(2x fst + $) 
+n,(t) cos 27 fet — ns (t) sin 27 fet] cos(27 fet + ĝ) 
A 1 R x 
= zm) cos Ad + zl) cos ¢ + ns (t) sing] 
+ double-frequency terms, (8.8.27 
Ys (t) = [Acm(t) cos(21 ft + $) 
+n,(t)cos2n fet — ns (t) sin 27 fot] sin(27 fet + ĝ) 
A 1 z A 
= zm) sin Ad + zelt) sing — n, (t) cos $] 


+ double-frequency terms, (8.8.28. 


where Ad = Ê — ġ. The double-frequency terms are eliminated by the lowpass filter: 
following the multiplications. 

An error signal is generated by multiplying the two outputs y/(t) and yj{(t) of thr 
lowpass filters. Thus, 


elt) = yiyi) 


= Ae maq) sin2Ad 
+ fe mOnt) cos È + n, (t) sing] sin Ad 
g “m(t)ine(t) sin Ê — n, (t) cos ẹ] cos Ad 
+ Tinet) cos È + ns (t) sin$][n-(t) sing — ns (t) cos ĝ]. 


This error signal is filtered by the loop filter whose output is the control voltage that drive 
the VCO. 

We note that the error signal into theloop filter consists of the desired term (A2m? (t) 
4) sin2A¢@, plus terms that involve signal x noise and noise x noise. These terms ar 
similar to the two noise terms at the input of the PLL for the squaring method. In fact, i 
the loop filter in the Costas loop is identical to that used in the squaring loop, the two loop 
are equivalent. Under this condition, the probability density function of the phase error an 
the performance of the two loops are identical. 
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In conclusion, the squaring PLL and the Costas PLL are two practical methods for 
deriving a carrier-phase estimate for the phase-coherent demodulation of a DSB-SC AM 
signal. 


8.8.3 Carrier-Phase Estimation for PAM 


In the demodulation of the carrier-modulated PAM signal described in Section 8.5.2, we 
assumed that the basis function y(t) is perfectly synchronized with the signal component 
of r(t) in both time and carrier phase, as shown in Figure 8.72. In practice, however, these 
ideal conditions do not hold. First, the propagation delay encountered in transmitting a 
signal through the channel results in a carrier-phase offset in the received signal. Second, 
the oscillator that generates the carrier signal cos2/f,t at the receiver is not generally 
phase-locked to the oscillator used at the transmitter. Practical oscillators usually drift in 
frequency and phase. Consequently, the demodulation of the bandpass PAM signal, as 
illustrated in Figure 8.72, is ideal, but not practical. In a practical system, it is necessary 
to generate a phase-coherent carrier at the receiver to perform the demodulation of the 
received signal. The PLL or the Costas loop may be used for this purpose. 

A functional block diagram of the receiver that employs a PLL for estimating the 
carrier phase is shown in Figure 8.73. 


t To detector 
f ()dt |-——~| Sampler = 


V2 w(t) cos 2nf,t 
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Figure 8.72 Ideal 
Oscillator demodulation of bandpass 
digital PAM signal. 
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Figure 8.73 Demodulation of carrier-amplitude-modulated signal by using a PLL to acquire the carrier phase. 
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Output 
t data 
f ()dt |——>| Sampler |—>| Detector |———» 
0 


Signal-pulse 
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cos (2n fet + $) 
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Received 
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r() Bandpass data 


Sampler Detector 
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Figure 8.74 Bandpass demodulation of a digital PAM signal via (a) correlation and (b) matched filtering. 


As an alternative to performing the correlation or matched filtering at the basebanı 
as shown in Figure 8.73, we may perform cross correlation or matched filtering eithe 
at the bandpass or at some convenient intermediate frequency. In particular, a bandpas 
correlator may be used to multiply the received signal r(t) by the amplitude-modulate: 
carrier V24 (t) cos(2x fet + ĝ), where cos(2x fet + ĝ) is the output of the PLL. Th 
product signal is integrated over the signaling interval T, the output of the integrator i 
sampled at t = T, and the sample is passed to the detector. If we use a matched filte 
instead of a correlator, the filter impulse response is Y(T — t) cos[2x fe(T — t) + ĝ]. Th 
functional block diagrams for these demodulators are shown in Figure 8.74. 


8.8.4 Carrier-Phase Estimation for PSK 


As previously indicated, in any carrier modulation system, the oscillators employed at th 
transmitter and the receiver are not generally phase locked. As a consequence, the receive 
PSK signal will be of the form 


r(t) = Amcgr(t)cos(2x fet + Ø) — Amsgr(t) sin(2n fet + 6) +n(t), (8.8.26 
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where ¢ is the carrier-phase offset. This phase offset must be estimated at the receiver, and 
the phase estimate must be used in the demodulation of the received signal. Hence, the 
received signal must be correlated with the two orthogonal basis functions 


1 x 
WO=,/ gor (t) cos(2n fet + $) 
and (8.8.30) 


1 A 
plt)= -țar sin(2x fet + ), 


where ¢ is the estimate of the carrier phase obtained by a PLL, as shown in Fig- 
ure 8.75. When g7(t) is a rectangular pulse, the signal pulse generator may be eliminated. 

When the digital information is transmitted via the M-phase modulation of a carrier, 
a PLL may be used to estimate the carrier-phase offset. For M = 2, the squaring PLL and 
the Costas loop described in Sections 8.8.1 and 8.8.2 are directly applicable. 

For M > 2, the received signal may first be raised to the Mth power, as shown in 
Figure 8.76. Thus, if the received signal r(t) has the form 


r(t)= Sm(t) + n(t) 
(8.8.31) 
= &r(t) cos (242 +ọ+ =) + n(t), 


and we pass r(t) through an Mth power device, the output signal will contain harmonics 
of the carrier fe. The harmonic that we wish to select is cos(2nM fet + Mọ) for driving 
the PLL. We note that © 
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Figure 8.75 Demodulator for an M-ary phase-modulated signal. 
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Figure 8.76 Carrier-phase estimation for an M-ary PSK signal. 


2 
M (FF) =2nm =O(mod2n), m=1,2,...,M. (8.8.32 


Thus, the information is removed from the Mth harmonic. The bandpass filter tuned tı 
the frequency M fe produces the desired frequency component cos(2x M fet + Mo), whic! 
is driving the PLL. The VCO output is sinQQnMf,t + M Ê), so this output is divided i 
frequency by M to yield sin(2x fet +) and is phase shifted by 1/2 to yield cos(2x fot +6) 
The two quadrature-carrier components are then passed to the demodulator. 

We should note that the quadrature-phase carrier components generated as previous! 
described, contain phase ambiguities of multiples of 21 /M that result from multiplying th 
carrier phase ¢ by M. Because M¢(mod 2v) is less than 2x, dividing the resulting angl 
by M yields a phase estimate of |d| < 2x /M. However, the true carrier phase may excee 
this estimate by multiples of 21 /M, i.e., by 2mk/M, for k = 1, 2,..., M — 1. Such phas 
ambiguities can be overcome by differentially encoding the data at the transmitter an 
differentially decoding at the detector, as described in Section 8.6.4. 

Just as in the case of the squaring PLL, the Mth power PLL operates in the presenc 
of noise that has been enhanced by the Mth power-law device. The variance of the phas 
error in the PLL resulting from the additive noise may be expressed in the simple form 


1 


; (8.8.3: 
SMLPL 





2o 
Ca 


where pz is the loop SN R and Sm is the M-phase power loss. Sy. has been evaluated t 
Lindsey and Simon (1973) for M = 4 and M = 8. 

Another method for extracting a carrier-phase estimate ĝ from the received sign 
for M-ary phase modulation is the decision-feedback PLL (DFPLL), which is shown : 
Figure 8.77. The received signal is demodulated by using the two quadrature phase-locke 
carriers given by Equation (8.8.30) to yield y = (y1, y2) atthe sampling instants. The pha: 
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Figure 8.77 Carrier recovery for an M-ary PSK using a decision-feedback PLL. 


estimate Ê = tan™! y2/y1 is computed at the detector and quantized to the nearest of the 
M possible transmitted phases, which we denote as Ên. The two outputs of the quadrature 
multipliers are delayed by one symbol interval T and multiplied by cos Êm and — sin Ôn. 
Thus, we obtain 


—r (t) cos(27 fet + ĝ) sin Ôm = —Fler(s) COS Ôm + n-(t)] sin Om cos(ġ — $) 


+ slant) sin Om + ns(t)] sin Om sin(d — ẹ) 


+ double-frequency terms, 
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where we assume that 6 vn = Om, and 


—r (t) sin(2m fet + $) COS Om = slsr (t) COS Om + ne(t)] COS Om sin(d — ) 


i. isr Gjinia E mW cosy cost — 8) 


+ double-frequency terms. 


These two signals are added together to generate the error signal 
: > 1 ; 7 
e(t) = 38r (0) sinl — $) — znel) sinh — $ — Om) 


= Tnet) cos(¢ — ĝ — Om) (8.8.34 


+ double-frequency terms. 


This error signal is the input to the loop filter that provides the control signal for th 
VCO. 

We observe that the two quadrature noise components in Equation (8.8.34) appear a 
additive terms and no term involves a product of two noise components, as in the output o 
the Mth power-law device. Consequently, there is no power loss resulting from nonlinea 
operations on the received signal in the DFPLL. The M-phase decision-feedback wackin 
loop also has phase ambiguities of 2nk/M, necessitating the need for differentially encod 
ing the information sequence prior to transmission and differentially decoding the receive 
sequence at the detector to recover the information. 


8.8.5 Carrier-Phase Estimation for QAM 


As in the case of PAM and PSK, the demodulation of a QAM signal requires a carrie 
that is phase locked to the phase of the received carrier signal. Carrier-phase estimation fc 
QAM can be accomplished in a number of different ways depending on the signal-poir 
constellation and the phase relationships of the various signal points. 

For example, consider the eight-point QAM signal constellation shown in Figur 
8.56(b). The signal points in this constellation have one of two possible amplitude value 
and eight possible phases. The phases are spaced 45° apart. This phase symmetry allows t 
to use a PLL driven by the output of an 8th power-law device that generates carrier compc 
nents at 8 fe, where fe is the carrier frequency. Thus, the method illustrated in Figure 8.7 
may be generally used for any QAM signal constellation that contains signal points wit 
phases that are multiples of some phase angle 0, where LO = 360° for some integer L. 

Another method for extracting a carrier-phase estimate @ from the received M-ar 
QAM signal is the DFPLL previously described in Section 8.8.4. The basic idea in tk 
DFPLL is to estimate the phase of the QAM signal in each signal interval and remov 
the phase modulation from the carrier. The DFPLL may be used with any QAM signa 
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regardless of the phase relationships among the signal points. To be specific, we express 
the received QAM signal as 


r(t) = Am8r(t) cos(2 fet + On + Q) + n(t), (8.8.35) 


where 6, is the phase of the signal point and ¢ is the carrier phase. This signal is demodu- 
lated by cross correlating r (t) with y(t) and w(t), which are given by Equation (8.7.4). 
The sampled values at the output of the correlators are 


Yı = Am VEs cos(Gn +$ — $) + ne cosy + > — Ê) — ns sin(a +Ø- Ô) (9936 
Y2 = AmvVE;s sin, + 6 — $) + ne sin’ + $ — È) —nscos® + $ — 4). a 


Now suppose that the detector, which is based on yı and y2, has made the correct 
decision on the wansmitted signal point. Then we multiply yı by — sin 6, and y2 by cos 6n. 
Thus, we obtain 


—yı sin p= —Am/8s cos(6, + $ — ĝ) sin 6, + noise component 

= A, J/€,[— sin ðn cos 6, cosl — ĝ) + sin? 0, sin(ġ — ĝ) + noise component, 
y2 cos6, = AmJ/€; sin(6, + Q — ĝ) cos 8, + noise component 

= Am /&,[sin 0, cos, cos(d — ĝ) + cos? 6, sin(g — ĝ)] + noise component. 


(8.8.37) 
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Figure 8.78 Demodulation and detection of QAM signals. 
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Adding these two terms, we obtain an error signal e(T), given as 


e(T)= y2C0S On — yı Sin, 


= AnV&, sin(d — é) + noise components. (8.8.38) 


This error signal is now passed to the loop filter that drives the VCO. Thus, only the phase 
6, of the QAM signal is used to obtain an estimate of the carrier phase. Consequently, the 
general block diagram for the DFPLL, which is given in Figure 8.77, also applies to carrier. 
phase estimation for an M-ary QAM signal. By using this DFPLL or the PLL describec 
above, the QAM signal can be demodulated and detected as shown in Figure 8.78. 

As in the case of digitally phase-modulated signals, this method for carrier-phase 
recovery results in phase ambiguities. This problem is solved generally by differentia 
encoding of the data sequence at the input to the modulator. 


8.9 SYMBOL SYNCHRONIZATION 


In a communication system, the output of the receiving filter y(t) must be sampled peri 
odically at the symbol rate and at the precise sampling time instants tm = mT + to, wher 
T is the symbol interval and to is a nominal time delay that depends on the propagatio1 
time of the signal from the transmitter to the receiver. To perform this periodic sampling 
we require a clock signal at the receiver. The process of extracting such a clock signal a 
the receiver is usually called symbol synchronization or timing recovery. 

Timing recovery is one of the most critical functions performed at the receiver of ; 
synchronous digital communication system. We should note that the receiver must knov 
not only the frequency (1/ T) at which the outputs of the matched filters or correlators arı 
sampled, but also where to take the samples within each symbol interval. The choice o 
sampling instant within the symbol interval of duration T is called the timing phase. 

The best timing corresponds to the time instant within the symbol interval where thi 
signal output of the receiver filter is a maximum. In a practical communication system, thi 
receiver clock must be continuously adjusted in frequency (1/T) and in timing phase To t 
compensate for frequency drifts between the oscillators used in the transmitter and receive 
clocks; thus, it will optimize the sampling time instants of the matched filter or correlato 
outputs. 

Symbol synchronization can be accomplished in one of several ways. In some com 
munication systems, the transmitter and receiver clocks are synchronized to a master clock 
which provides a very precise timing signal. In this case, the receiver must estimate an 
compensate for the relative time delay between the transmitted and received signals. Suc] 
may be the case for radio communication systems where precise clock signals are trans 
mitted from a master radio station. 

Another method for achieving symbol synchronization is for the transmitter to simul 
taneously transmit the clock frequency 1/T or a multiple of 1/T along with the informa 
tion signal. The receiver may simply employ a narrowband filter tuned to the transmitte: 
clock frequency; thus, it can extract the clock signal for sampling. This approach has th 
advantage of being simple to implement. There are several disadvantages, however. On 
is that the transmitter must allocate some of its available power to the transmission of th 
clock signal. Another is that some small fraction of the available channel bandwidth mus 
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be allocated for the transmission of the clock signal. In spite of these disadvantages, this 
method is frequently used in telephone wansmission systems that employ large bandwidths 
to transmit the signals of many users. In such a case, the wansmission of a clock signal is 
shared in the demodulation of the signals among the many users. Through this shared use 
of the clock signal, the penalty in transmitter power and in bandwidth allocation is reduced 
proportionally by the number of users. 

A clock signal can also be extracted from the received data signal. There are a number 
of different methods that can be used at the receiver to achieve self-synchronization. Next, 
we will consider four approaches to the problem of achieving symbol synchronization from 
the received signal. 


8.9.1 Early-Late Gate Synchronizers 


One method for generating a symbol timing signal at the receiver exploits the symmetry 
properties of the signal at the output of the matched filter or correlator. To describe this 
method, let us consider the rectangular pulse s(t),0 < t < T, shown in Figure 8.79(a). 
The output of the filter matched to s(t) attains its maximum value at time t = T, as 
shown in Figure 8.79(b). Thus, the output of the matched filter is the time autocorrelation 
function of the pulse s(t). Of course, this statement holds for any arbitrary pulse shape, 
so the approach that we describe generally applies to any signal pulse. Clearly, the proper 
time to sample the output of the matched filter for a maximum output is at t = T, i.e., at 
the peak of the correlation function. 

In the presence of noise, the identification of the peak value of the signal is gene- 
rally difficult. Instead of sampling the signal at the peak, suppose we sample early 
(att =T — ôT) and late (at t = T + ôT). The absolute values of the early samples 


s(t) 








A 


i Matched filter output 
Optimum sample 


Early sample Late sample 





EZT POPREL 2E Figure 8.79 (a) Rectangular signal pulse and (b) its 
(b) matched filter output. 
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|y[m(T — 5T)]| and the late samples | y[m(T +6T )]| will be smaller (on the average in the 
presence of noise) than the samples of the peak value |y(mT) |. Since the autocorrelation 
function is even with respect to the optimum sampling time t = T, the absolute values of 
the correlation function at t = T — ôT and t = T + ôT are equal. Under this condition, the 
proper sampling time is the midpoint between t = T — ôT and t = T + ôT. This condition 
forms the basis for the early—late gate symbol synchronizer. 

Figure 8.80 illustrates the block diagram of an early—late gate synchronizer. In this 
figure, correlators are used in place of the equivalent matched filters. The two correlators 
integrate over the symbol interval T, but one correlator starts integrating ôT early relative 
to the estimated optimum sampling time, and the other integrator starts integrating 5T late 
relative to the estimated optimum sampling time. An error signal is formed by taking the 
difference between the absolute values of the two correlator outputs. To smooth the noise 
corrupting the signal samples, the error signal is passed through a lowpass filter. If the 
timing is off relative to the optimum sampling time, the average error signal at the output of 
the lowpass filter is nonzero, and the clock signal is either retarded or advanced, depending 
on the sign of the error. Thus, the smoothed error signal is used to drive a voltage-controlled 
oscillator, whose output is the desired clock signal that is used for sampling. The output of 
the VCO is also used as a clock signal for a symbol waveform generator, which puts out 
the same basic pulse waveform as the transmitting filter. This pulse waveform is advanced 
and delayed, and then it is fed to the two correlators, as shown in Figure 8.80. Note that 
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Figure 8.80 Block diagram of early—late gate synchronizer. 
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if the signal pulses are rectangular, there is no need for a signal pulse generator within the 
tracking loop. 

We observe that the early—late gate synchronizer is basically a closed-loop control 
system whose bandwidth is relatively narrow compared to the symbol rate 1/7. The band- 
width of the loop determines the quality of the timing estimate. A narrowband loop pro- 
vides more averaging over the additive noise; thus, it improves the quality of the estimated 
sampling instants, provided that the channel propagation delay is constant and the clock 
oscillator at the transmitter is not drifting with time (drifting very slowly with time). On the 
other hand, if the channel propagation delay is changing with time or the transmitter clock 
is also drifting with time, then the bandwidth of the loop must be increased to provide for 
faster tracking of time variations in symbol timing. This increases the noise in the loop and 
degrades the quality of the timing estimate. 

In the tracking mode, the two correlators are affected by adjacent symbols. However, 
if the sequence of information symbols has zero mean, as is the case for PAM and some 
other signal modulations, the contribution to the output of the correlators from adjacent 
symbols averages out to zero in the lowpass filter. 

An equivalent realization of the early—late gate synchronizer that is somewhat easier 
to implement is shown in Figure 8.81. In this case, the clock from the VCO is advanced 
and delayed by ôT, and these clock signals are used to sample the outputs of the two 
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Figure 8.81 Block diagram of early—late gate synchronizer—an alternative form. 
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8.9.2 Minimum Mean Square Error Method 


Another approach to the problem of timing recovery from the received signal is basex 
on the minimization of the mean square error (MSE) between the samples at the outpu 
of the receiver filter and the desired symbols. We assume that the baseband signal at thy 
transmitter is of the form 


v(t) = D angr(t — nT), (8.9.1 


n=—00 


where {a,,} is the data sequence and T is the symbol interval. To be specific, let us assum: 
that v(t) is a PAM baseband signal and the data sequence {an} is a zero mean, stationar 
sequence with statistically independent and identically distributed elements. Therefore, th 
signal v(t) has zero mean, i.e., E(v(t)) = 0. Furthermore, the autocorrelation function o 
u(t) is periodic in T; hence, v(t) is a periodically stationary process. 

The received signal at the output of the matched filter at the receiver may be expresse 
as 


y(t) = J ayx(t — nT — To) + w(t), (8.9.2 


n=—oo 


where x(t) = gr(t)*&r(t), the asterisk denotes convolution, gr (t) isthe impulse respons: 
of the receiver filter, w(t) represents the noise at the output of the receiver filter, and t 
(To < T) represents the timing phase. 

The MSE between the output of the receiver filter and the desired symbol at the mt 
symbol interval is defined as 


MSE = E{[ym (to) — aml}, (8.9.3 
where 
Ym(to.) = J` ayx(mT — nT — 1) + w(mT). (8.9.4 


Since the desired symbol a,, is not known a priori at the receiver, we may use the output c 
the detector, denoted as âm, for the mth symbol. Thus, we substitute â, for am in the MS! 
expression. Hence, the MSE is redefined as 


MSE = E{[ym(to) — Gm}. (8.9.8 


The minimum of MSE with respect to the timing phase To is found by differentiatin 
Equation (8.9.5) with respect to to. Thus, we obtain the necessary condition 


a 7 dYm(T0) 

Y [ym(t0) — ân] 22 = 0, (8.9.¢ 
d To 

m 

An interpretation of the necessary condition in Equation (8.9.6) is that the optimu: 

sampling time corresponds to the condition that the error signal [y(t) — âm] is uncorre 


lated to the derivative dy (To) /dt. Since the detector output is used in the formation ¢ 
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the error signal [ym (To) — âm], this timing phase-estimation method is said to be decision 
directed. 

Figure 8.82 illustrates an implementation of the system that is based on the condition 
given in Equation (8.9.6). Note that the summation operation is implemented as a lowpass 
filter, which averages a number of symbols. The averaging time is roughly equal to the 
reciprocal of the bandwidth of the filter. The filter output drives the voltage-controlled 
oscillator, which provides the best MSE estimate of the timing phase To. 


8.9.3 Maximum-Likelihood Method 


In the ML method, the optimum symbol timing is obtained by maximizing the likelihood 
function, defined as 


A(t) = È a@mYm (To); (8.9.7) 


where y,,(To) is the sampled output of the receiving filter given by Equation (8.9.4). From 
a mathematical viewpoint, the likelihood function can be shown to be proportional to the 
probability of the received signal (vector) conditioned on a known transmitted signal. Phys- 
ically, A (To) is simply the output of the matched filter or correlator at the receiver averaged 
over a number of symbols. 
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Figure 8.82 Timing recovery based on minimization of MSE. 
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Figure 8.83 Decision-directed ML timing recovery method for baseband PAM. 


A necessary condition for To to be the ML estimate is that 


(8.9.8 





dA(t%) _ dYm(To) _ 
= D an = 


dt 


This result suggests the implementation of the tracking loop shown in Figure 8.83. Wi 
observe that the product of the detector output âm with dym(to)/dt is averaged by a low 
pass filter that drives the VCO. Since the detector output is used in the estimation method 
the estimate Zo is decision directed. 

As an alternative to using the output symbols from the detector, we may use : 
nondecision-directed method that does not require knowledge of the information symbols 
This method is based on averaging over the statistics of the symbols. For example, we ma 
square the output of the receiving filter and maximize the function 


A(t) = > ya) (8.9.9 
with respect to to. Thus, we obtain 
d A(T) dym (To) 
—— =2 m(T) ——— = 0. 8.9.10 
dt 2 DO ae ( 


The condition for the optimum 79 given by Equation (8.9.10) may be satisfied b 
the implementation shown in Figure 8.84. In this case, there is no need to know the dat 
sequence {am}. Hence, the method is nondecision directed. 


8.9.4 Spectral-Line Method 
Since the signal component at the output of the receiver filter is periodic with period 7 


we can recover a clock signal with frequency 1/T by filtering out a signal component : 
f = 1/T. We observe, however, that E(y(t)) = 0 because E(a,) = 0. Therefore, y(t 
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Figure 8.84 Nondecision-directed estimation of timing for baseband PAM. 


cannot be used directly to generate a frequency component at f = 1/7. On the other 
hand, we may perform a nonlinear operation on y(t) to generate power at f = 1/T and its 
harmonics. 

Let us consider a square-law nonlinearity. If we square the signal y(t) given by 
Equation (8.9.2) and take the expected value with respect to the data sequence {an}, we 
obtain 


E(y*(t)) =E (x 5 anamx(t — mT — t)x(t — nT — m) + noise component 


n m 


[>e] 
= ož D x? (t —nT — To) + noise component, (8.9.11) 


n=—00 


where ož = E (a?). Since E (y?(t)) > 0, we may use y?(t) to generate the desired fre- 
quency component. 

Let us use Poisson’s sum formula on the signal component (see Problem P-2.56) to 
express Equation (8.9.11) in the form of a Fourier series. Hence, 


2 
o : 
ož Yoxa — nT —%) = T Xo cn el2nm(t—t0)/T (8.9.12) 
n n 


where 


Cm = [ rox (+ z f) df. (8.9.13) 


By design, we assume that the transmitted signal spectrum is confined to frequencies below 
1/T. Hence, X(f) = 0 for |f| > 1/T; consequently, there are only three nonzero terms 
(m = 0, +1) in Equation (8.9.12). Therefore, the square of the signal component contains 
a DC component and a component at the frequency 1/T. 
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This development suggests that we square the signal y(t) at the output of the receiv 
ing filter and filter y? (t) with a narrowband filter B( f) tuned to the symbol rate 1/7. If w 
set the filter response B(1/T) = 1, then 


2 ; 2 2 
eRe [c ei2m(t—to)/T ] = = cos Se — To), (8.9.14 


so that the timing signalis a sinusoid with a phase of —2 7 to/ T , assuming that X ( f) is rea] 
We may use alternate zero crossings of the timing signal as an indication of the correct sam 
pling times. However, the alternate zero crossings of the signal given by Equation (8.9.14 
occur at 


2x T 
7€ — To) = (4k + D5: (8.9.15 
or equivalently, at 


T 
t=kT+%+ T (8.9.16 


which is offset in time by T/4 relative to the desired zero crossings. In a practical syster 
the timing offset can be easily compensated either by relatively simple clock circuitry o 
by designing the bandpass filter B(f) to have a x/2 phase shift at f = 1/T. Figure 8.8. 
illustrates this method for generating a timing signal at the receiver. 

The additive noise that corrupts the signal will generally cause fluctuations in th 
zero crossings of the desired signal. The effect of the fluctuations will depend on th 
amplitude cı of the mean timing sinusoidal signal given by Equation (8.9.14). We not 
that the signal amplitude c; is proportional to the slope of the timing signal in the vicinit 
of the zero crossing, as shown in Figure 8.86. Therefore, when the amplitude c; is large 
the slope will be larger, consequently, the timing errors due to the noise will be smalle 
From Equation (8.9.13), we observe that cı depends on the amount of spectral overlap c 
X(f) and X(1/T — f). Thus, cı depends on the amount by which the bandwidth of X (f 
exceeds the (Nyquist) bandwidth 1/2T. In other words, cı depends on the excess banc 
width of the signal, which is defined as the band of frequencies of the signal X(f) beyon 
f =1/2T. If the excess bandwidth is zero, i.e., X(f) = 0 for |f| > 1/2T, then c, = ( 
thus, this method fails to provide a timing signal. If the excess bandwidth is large, such a 
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Figure 8.86 Illustration of the slope of 
the sinusoid at the zero crossing as a 
function of the amplitude. 


a/2T where œ = 1/2 or 1, the timing signal amplitude will be sufficiently large to yield 
relatively accurate symbol timing estimates. 


8.9.5 Symbol Synchronization for Carrier-Modulated Signals 


The symbol-timing synchronization methods described above for baseband signals also 
apply to bandpass signals. Because any carrier-modulated signal can be converted to a 
baseband signal by a simple frequency translation, symbol timing can be recovered from 
the received signal after the frequency conversion to baseband. 

For QAM signals, the spectral-line methods described in Section 8.9.4 have proven 
to be particularly suitable for timing recovery. Figure 8.87 illustrates a spectral-line method 
that is based on filtering out a signal component at the frequency 1/2T and squaring the 
filter output to generate a sinusoidal signal at the desired symbol rate 1/ T. Because the 
demodulation of the QAM signal is accomplished as previously described [by multipli- 
cation of the input signal with the two quadrature-carrier signals y(t) and w(t)], the 
in-phase and quadrature signal components at the outputs of the two correlators are used 
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Figure 8.87 Block diagram of a timing recovery method for QAM. 
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as the inputs to the two bandpass filters tuned to 1/27. The two filter outputs are square; 
(rectified), summed, and then filtered by a narrowband filter tuned to the clock frequenc: 
1/T. Thus, we generate a sinusoidal signal that is the appropriate clock signal for samplin; 
the outputs of the correlators to recover the information. 

In many modern communication systems, the received signal is processed (demod 
ulated) digitally after it has been sampled at the Nyquist rate or faster. In these cases, th 
symbol timing and carrier phase are recovered by signal-processing operations performe 
on the signal samples. Thus, a PLL for carrier recovery is implemented as a digital PLL 
and the clock recovery loop of a type described in this section is also implemented as 
digital loop. References on timing recovery methods based on sampled signals are given i 
Section 8.11. 


8.10 REGENERATIVE REPEATERS 


In Section 6.4.4, we described analog repeaters, which are amplifiers that are generall 
used in telephone wireline channels and microwave line-of-sight radio channels to boo: 
the signal level and, thus, to offset the effect of signal attenuation in transmission throug 
the channel. In this section, we treat regenerative repeaters, which are generally used i 
digital transmission systems. 

The front end of each regenerative repeater consists of a demodulator/detector thi 
demodulates and detects the transmitted digital information sequence sent by the precedin 
repeater. Once detected, the sequence is passed to the transmitter side of the repeater, whic 
maps the sequence into signal waveforms that are transmitted to the next repeater. This tyg 
of repeater is called a regenerative repeater. 

Since a noise-free signal is regenerated at each repeater, the additive noise does n 
accumulate. However, when errors occur in the detector of a repeater, the errors are prop: 
gated forward to the following repeaters in the channel. To evaluate the effect of errors c 
the performance of the overall system, suppose that the modulation is binary PAM, so th: 
the probability of a bit error for one hop (signal transmission from one repeater to the ne: 
repeater in the chain) is 


Since errors occur with low probability, we may ignore the probability that any o1 
bit will be detected incorrectly more than once during the transmission through a chann: 
with K repeaters. Consequently, the number of errors will increase linearly with the nun 
ber of regenerative repeaters in the channel, and thus the overall probability of error mz 
be approximated as 
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In contrast, the use of K analog repeaters in the channel reduces the received SNR by K; 
hence, the bit-error probability is 


TA A ). (8.10.2) 


Clearly, for the same probability of error performance, the use of regenerative repeat- 
ers results in a significant savings in transmitter power over analog repeaters. Hence, in 
digital communication systems, regenerative repeaters are preferable. However, in wire- 
line telephone channels that are used to transmit both analog and digital signals, analog 
repeaters are generally employed. 


Example 8.10.1 


A binary digital communication system transmits data over a wireline channel of length 1000 
kilometers. Repeaters are used every 10 kilometers to offset the effect of channel attenuation. 
Determine the 6,/No that is required to achieve a probability of a bit error of 1075 if (a) 
analog repeaters are employed and (b) regenerative repeaters are employed. 


Solution The numter of repeaters used in the system is K = 100. If regenerative repeaters 
are used, the ,/No obtained from Equation (8.10.1) is 


which yields approximately 11.3 dB. If analog repeaters are used, the 6,/No obtained from 
Equation (8.10.2) is 


26s 
100M0 }’ 





10$ = Q 


which yields an ,/No of 29.6 dB. Hence, the difference on the required SNR is about 18.3 
dB, or approximately 70 times the transmitter power of the digital communication system. 
n 





4 8.11 SUMMARY AND FURTHER READING 


In this chapter, we introduced the reader to basic digital modulation methods for trans- 
mitting digital information on physical communication channels in which the transmitted 
signal is corrupted by additive noise that is modeled as white and Gaussian (AWGN). 
We began by developing a geometric representation of digital signals as vectors. Such a 
representation provides acompact characterization of signals for transmitting digital infor- 
mation over a channel, and it simplifies the analysis of their performance. 

Using the vector representation, we classified different signals by their dimensional- 
ity. Signals that are represented as a one-dimensional vector (scalar) are binary antipodal 
signals (binary PAM) and M-ary (nonbinary) pulse amplitude modulation (M-ary PAM). 
















































































458 Digital Modulation in AWGN Channel Chapter ¢ 


These one-dimensional signals can be transmitted either on a baseband channel, or on ; 
bandpass channel by simply impressing the digital signal onto a carrier frequency. 

We also introduced digital signals whose geometric representation took the form oj 
a two-dimensional vector. Examples of two-dimensional signals are binary PPM, binary 
FSK, PSK, DPSK, and QAM. Binary PPM is suitable for transmssion on baseband chan. 
nels and, by using the PPM signal to amplitude modulate a carrier frequency, the result: 
ing signal can be transmitted on a bandpass channel. In contrast, the binary FSK, PSK 
DPSK, and QAM are bandpass signals and, hence, are suitable for transmission on band 
pass channels. , 

A major emphasis of this chapter was placed on deriving the signal processing tha 
is performed by the receiver, whose function is to recover the transmitted digital informa 
tion as reliably as possible in the presence of AWGN. The receiver processing was spli 
into two parts, the first being the demodulator, which is then followed by the detector. W 
demonstrated that the demodulator can be implemented either as a cross correlator or a: 
a matched filter. The outputs of the demodulator are sampled at the bit rate (for binan 
signals) or at the symbol rate (for M-ary signals) and fed to the detector, which decide; 
on the most probably transmitted symbol. We observed that the optimum decision rul 
used by the detector is based on the maximum a posteriori probability (MAP) criterion o 
the maximum-likelihood (ML) criterion, depending on whether tie digital signals that ar 
transmitted have unequal or equal a priori probabilities of occurring. Based on this imple 
mentation of the optimum detector, we derived the probability of error for the differen 
signal types, i.e., PAM, PSK, DPSK, QAM, and binary FSK. 

In the transmission of carrier modulated signals, such as PAM, PSK, and QAM, w 
observed that it is necessary to estimate the phase of the received carrier signal in orde 
to perform phase coherent demodulation and, thus, to recover the transmitted informa 
tion. Methods for estimating the carrier phase from the incoming received PAM, PSK 
or QAM signals were described, using as a basic element the phase-locked-loop (PLL; 
whose operating characteristics were described in Section 8.8.1. 

In this chapter, we also treated the important problem of extracting a timing signz 
from the received signal, which serves as a clock for periodically sampling the output o 
the demodulator at the symbol rate. 

The geometrical representation of digital signals as vectors was first used by Kotel 
nikov (1947) and Shannon (1948a, 1948b) (in his classic papers). This approach was por 
ularized by Wozencraft and Jacobs (1965). Today, this approach to signal analysis an 
design is widely used. Treatments similar to those given in the text may be found in mo: 
books on digital communications. 

The matched filter was introduced by North (1943), who showed that it maximize 
the SNR. Analyses of various binary and M-ary modulation signals in AWGN were pe! 
formed in the two decades following Shannon’s work. Treatments similar to those given i 
the chapter may be found in most books on digital communications. 

A number of books and tutorial papers have been published on the topic of tim 
synchronization. Books that cover both carrier-phase recovery and time synchronizatio 
have been written by Stiffler (1971), Lindsey (1972), Lindsey and Simon (1973), Meyr an 
Ascheid (1990), and Mengali and D’ Andrea (1997). The tutorial paper by Franks (198( 
presents a very readable introduction to this topic. 
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8.1 Consider the three waveforms Wy, (t) shown in Figure P-8.1. 


1. Show that these waveforms are orthonormal. 


2. Express the waveform x(t) as a weighted linear combination of yf, (t),n = 
1, 2, 3 if 


-1, O<t<l 
x(t) = 1 1<t<3 
=, <4 


and determine the weighting coefficients. 





WO a(t) 
1 1 
2 2 
0 2 4 t o 4 t 
pt) 
1 er 
2 á 
0 1 2 3 i t 
_1)_ 
2 
Figure P-8.1 


8.2 Use the orthonormal waveforms in Figure P-8.1 to approximate the function 


x (t) = sin (nt/4) 
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over the interval 0 < t < 4 by the linear combination 


3 
EO) =D etn O. 


n=1 


1. Determine the expansion coefficients {c,} that minimize the mean square approx. 
imation error f g 


E= A [x@ -i OF at. 
0 
2. Determine the residual mean square error Emin. 
8.3 Consider the four waveforms shown in Figure P-8.3. 


1. Determine the dimensionality of the waveforms and a set of basis functions. 
2. Use the basis functions to represent the four waveforms by vectors S1, $2, 83, S4 
3. Determine the minimum distance between any pair of vectors. 


s,(t) | 53(4) 
2 





Figure P-8.3 
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8.4 Determine a set of orthonormal functions for the four signals shown in Figure P-8.4 
You may determine these functions by inspection. 


SG) 52(0) 


DEE 2 
l | = t » 
: 0 1 2 3 1 


53(t) s4(t) 





Figure P-8.4 


8.5 Show that if fe = where k is a positive integer, then the energy in each of the 
signals sı (t) and s2(t), given by Equation (8.2.5), is equal to €. Also show that if 
the condition fs = a is not satisfied but f-Tp >> 1, then the energy in sı (t) and 
S(t) can be very closely approximated by €p. Note that since T, is the duration 
of the rectangular pulse, 1/7; approximates its bandwidth. Therefore, the condition 
fceTo >> 1 means that the carrier frequency fe is much larger than the bandwidth of 


the rectangular baseband pulse, a condition that is usually satisfied. 


(Eshon that if fi = a and fp = a where kı and kz are distinct positive integers, 
then the energy in each of the signals sı (t£) and s2(t), given by Equation (8.2.15), 
is equal to p, and furthermore, they are orthogonal. Also show that if the above 
conditions are not satisfied but fiT, >> 1 and fT, > 1, then the energy in s(t) 
and s(t) can be very closely approximated by @», and the inner product of sı (t) and 
52(t) is very close to zero. Note that since T, is the duration of the rectangular pulse, 
1/T, approximates its bandwidth. Therefore, the conditions fı 7, >> land hT, > 1 
mean that the carrier frequencies fı and fz are_much larger than the bandwidth of 

the rectangular baseband pulse, a condition that is usually satisfied. 


8.7 The received signal in a binary communication system that employs antipodal signals 
is 
r@=s(t)+n(t), 
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where s(t) is shown in Figure P-8.7 and n(t) is AWGN with power spectral density 
No/2 W/Hz. 


1. Sketch the impulse responses of the filter matched to s(t). 
2. Sketch the output of the matched filter to the input s (t). 
3. Determine the variance of the noise of the output of the matched filter at t = 3. 


4. Determine the probability of error as a function of A and No. 


s(t) 


=- { 
0 1 2 3 Figure P-8.7 


8.8 A digital communication system employs the four equiprobable signals given below 


1 O<t<l 
s(t) ={-—l1 1<t<2, 
0 otherwise 


2 0<tż<1 
s(t)={(1 1<t<3, 
0 otherwise 


s3(t) = —sı (t) 


kOe 2 2<t<3 
ae, 0 otherwise ` 


1. Determine an orthonormal basis for representation of these signals. 
2. Draw the constellation representing these signals. 
3. What is the dimensionality of this signal set? 


4. Using the vector representation of these signals determine the average energy : 
this signal set and the average energy per bit Ebay. 
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8.9 A matched filter has the frequency response 
i j2nfT 
j2nf 


1. Determine the impulse response h (t) corresponding to H (f). 


H(f)= 


2. Determine the signal waveform to which the filter characteristic is matched. 


8.10 Prove that when a sinc pulse gr (t) is passed through its matched filter, the output is 
the same sinc pulse. 


8.11 The demodulation of the binary antipodal signals 


T? 0 <t< T 
si(t) = —s2(t) = 
0, otherwise 
can be accomplished by use of a single integrator, as shown in Figure P-8.11, which 
is sampled periodically at t = kT, k = 0, +1, +2,.... The additive noise is zero- 
mean Gaussian with power spectral density of *o W/Hz. 


1. Determine the output SNR of the demodulator at t = T. 


2. If the ideal integrator is replaced by the RC filter shown in Figure P-8.11, deter- 
mine the output SNR as a function of the time constant RC. 


3. Determine the value of RC that maximizes the output SNR. 


Output 
r(t) =s,(t) + n(t decision 
t=kT 
R 
o—\W\4 0 


"T t=kT+r 1<<T 


Figure P-8.11 


Sketch the impulse responses of the filters matched to the pulses shown in Figure 
P-8.12. Also determine and sketch the output of each of the matched filters. 
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54(2) 52(t) 
1 
t 
0 T 
(a) 

53(t) 

2 

0 T T a 

2 
(c) 
Figure P-8.12 


8.13 A binary PAM communication system employs rectangular pulses of duration < 
and amplitudes +A to transmit digital information at a rate R = 10° bits/sec. If tl 
power spectral density of the additive Gaussian noise is No/2, where No = 10° 
W/Hz, determine the value of A that is required to achieve an error probability | 
Pa = 1076. 


8.14 In a binary PAM communication system fok which the two signals occur with unequ 
probabilities (p and 1 — p), the opein ‘detector compares the output of the corr 
lator, which takes the values 

y= - +/6, +n 


with a threshold œ, which is given by Equation (8.3.56). The resulting error prob 
bility of the detector is given by Equation (8.3.53). 


1. Determine the value of the threshold a = a* for p = 0.3, 6, = 1 and No = 0. 
2. Determine the average probability of errorwhena = a*. 


8.15 Suppose that two signal waveforms sı (t) and sz (t) are orthogonal over the inter 
(0, T). A sample function n (t) of a zero mean, white noise process is cross correlat 
with sı (£) and s2 (t) to yield : 


T 
ny =f sı (t) n (t)dt 
0 
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and 


T 
n = Í s (t)n (t)dt. 
0 
Prove that E (njn2) = 0. 
A binary PAM communication system is used to transmit data over an AWGN chan- 
” nel. The prior probabilittes for the bits are P (am = 1) = 1/3 and P (am 


=-1l)= 
2/3. 


ete NS A 


i 
; | & 
1. Determine the optimum threshold at the detector. ae 
T 


2. Determine the average probability of error. 


3. Evaluate the average probability of error when €, = 1 and No = 0.1. 


8.17 In a binary antipodal signaling scheme, the signals are given by 


2A T 
T. 0<t<3 

s(t) = s(t) = {2A (1-4), Eset <T. 
0, is otherwise 


r 


P 
The channel is AWGN and S, (f) = Mo The two signals have prior probabilities p 
and 1 — p. 

1. Determine the structure of the optimal receiver. 

2. Determine an expression for the error probability. 


3. Plot the error probability as a function of p for0 < p < 1. 


8.18 In an additive white Gaussian noise channel with noise power spectral density of ‘a, 
two equiprobable messages are transmitted by 


A 0<t<T 
HOS Os 
0, otherwise 


ae A(1-— 4), O0<t<T 
AS , otherwise ` 


1. Determine the structure of the optimal receiver. 


2. Determine the probability of error. 


8.19 Suppose that binary PAM is used for transmitting information over an AWGN with 
power spectral density of No/2 = 10~!° W/Hz. The transmitted signal energy is 
Eb = A?T/2, where T is the bit interval and A is the signal amplitude. Determine 
the signal amplitude required to achieve an error probability of 1076 if the data rate 
is (a)10 kbps, (b)100 kbps, and (c) 1 Mbps. 
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8.20 A Manchester encoder maps an information 1 into 10 and a 0 into 01. The sig- 
nal waveforms corresponding to the Manchester code are shown in Figure P-8.20, 


Determine the probability of error if the two signals are equally probable. 


si() 52(8) 


A A — 
0 t 0 t 
| T T 
-A -A 
Figure P-8.20 


8.21 The signal waveform 





e' O0<t<T 
s(t) = . 
0 otherwise 


is passed through its matched filter, A(t) = s(T — t). Determine the output of the 
matched filter. 


8.22 The data rate in a binary PAM communication system with AWGN is 2 Mbps. E 
the desired average-error probability is 1076, determine the SNR/bit, €, /No, and the 
power-to-noise ratio Pay/ No. 


8.23 A binary digital communication system employs the signals 


So(t)=0, O<t<T 
and 
sı (t) =A, O<t<T 


for transmitting the information. This is called on-off signaling. The demodulato 
cross correlates the received signal r (t) with s (t) and samples the output of th 
correlator at t = T. 


1. Determine the optimum detector for an AWGN channel and the optimum thresh 
old, assuming that the signals are equally probable. 

2. Determine the probability of error as a function of the SNR. How does on--ol 
signaling compare with antipodal signaling? 


8.24 Consider the signal 


£tcos2nfet, 0<t<T 
0, otherwise 


vo=| 
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1. Determine the impulse response of the matched filter for this signal. 
2. Determine the output of the matched filter at t = T. 


3. Suppose the signal u (t) is passed through a correlator that correlates the input 
u (t) with u (t) . Determine the value of the correlator output at t = T. Compare 
your result with that in Part (2). 


8.25 A carrier component is transmitted on the quadrature carrier in a communication 
system that transmits information via binary PSK. Hence, the received signal has the 
form 


v (t) = +/2P, cos (2n fet + P) + V2P; sin (2n fet + 6) +n(t), 


where ¢ is the carrier phase and n (t) is AWGN. The unmodulated carrier component 
is used as a pilot signal at the receiver to estimate the carrier phase. 


1. Sketch a block diagram of the receiver; including the carrier phase estimator. 


2. Mathematically illuswate the operations involved in the estimation of the carrier 
phase ¢. 


3. Express the probability of error for the detection of the binary PSK signal as 
a function of the total wansmitted power Pr = Ps + Pe. What is the loss in 
performance due to the allocation of a portion of the transmitted power to the 
pilot signal? Evaluate the loss for P;/Pr = 0.1. 


8.26 In the demodulation of a binary PSK signal received in white Gaussian noise, a 
phase-locked loop is used to estimate the carrier phase @. 


1. Determine the effect of a phase error ġ — $ on the probability of error. 
2. What is the loss in SNR if the phase error ¢ — FA = 45°? 


8.27 Consider the four-phase and eight-phase signal constellations shown in Figure 
P-8.27. Determine the radii rı and r2 of the circles so that the distance between 
two adjacent points in the two constellations is d. From this result, determine the 
additional wansmitted energy required in the 8-PSK signal to achieve the same error 
probability as the four-phase signal at high SNR, where the probability of error is 
determined by errors in selecting adjacent points. 


TA 
ay 


M=8 Figure P-8.27 
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$28 Consider the two 8-point QAM signal constellations shown in Figure P-8.28. The 

Lf minimum distance between adjacent points is 2A. Determine the average transmitted 

` power for each constellation assuming that the signal points are equally probable. 
Which constellation is more power efficient? 





(b) Figure P-8.28 


8.29 The 16-QAM signal constellation shown in Figure P-8.29 is an international standarc 
for telephone line modems (called V.29). Determine the optimum decision bound. 
aries for the detector; assume that the SNR is sufficiently high so that errors only 
occur between adjacent points. 


bi 





=5 
i Figure P-8.29 
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8.30 Specify a Gray code for the 16-QAM V.29 signal constellation shown in Problem 
8.29. 


8.31 Consider the octal signal point constellations in Figure P-8.31. 


1. The nearest neighbor signal points in the 8-QAM signal constellation are sepa- 
rated in distance by A units. Determine the radii a and b of the inner and outer 
circles. 


2. The adjacent signal points in the 8-PSK are separated by a distance of A units. 
Determine the radius r of the circle. 
3. Determine the average transmitter powers for the two signal constellations, and 


compare the two powers. What is the relative power advantage of one constella- 
tion over the other? (Assume that all signal points are equally probable.) 











8-PSK 8-QAM Figure P-8.31 


8.32 Consider the eight-point QAM signal constellation shown in Figure P-8.31. 


1. Assign three data bits to each point of the signal constellation so that the nearest 
(adjacent) points differ in only one bit position. 


2. Determine the symbol rate if the desired bit rate is 90 Mbps. 


3. Compare the SNR required for the eight-point QAM modulation with that re- 
quired for an eight-point PSK modulation having the same error probability. 


4. Which signal constellation (cight-point QAM or eight-point PSK) is more im- 
mune to phase errors? Explain the reason for your answer. 


8.33 Three messages, mı, m2, and m3, are to be transmitted over an AWGN channel with 
noise power spectral density mo. The messages are 


1 O<t<T 
sa | 0 otherwise 
and 
1 O<t<f 
ler<T. 


s2(t) = —s3(t) = 4 —1 < 
7 0 otherwise 
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1. What is the dimensionality of the signal space? 


2. Find an appropriate basis for the signal space. (Hint: You can find the basis 
without using the Gram-Schmidt procedure.) 


3. Sketch the signal constellation for this problem. 
4. Derive and sketch the optimal decision regions Rj, Ro, and R3. 


5. Which of the three messages is more vulnerable to errors and why? In othe 
words, which of the P(error|m; transmitted), i = 1, 2, 3 is larger? 

6. Using the union bound, find an upper bound on the error probability of thi: 
signaling scheme. 


8.34 A three-level PAM system is used to wansmit the output of a memoryless ternary 
source whose rate is 2000 symbols/sec. The signal constellation is shown in Figure 
P-8.34. Determine the input to the detector, the optimum threshold that minimize: 
the average probability of error, and the average probability of error. 


-A 0 A Figure P-8.34 


8.35 Consider a signal detector with an input 
r=+A +n, 


where +A and —A occur with equal probability and the noise variable n is charac 
terized by the (Laplacian) PDF shown in Figure P-8.35. 
1. Determine the probability of error as a function of the parameters A and ø. 


2. Determine the “SNR” required to achieve an error probability of 1075. How doe 
the SNR compare with the result for a Gaussian PDF? 





p(n) = 


l e-l VZ 
V20 





n FigureP-8.35 


8.36 Determine the average energy of a set of M PAM signals of the form 


Sm (t) =SmW (t), m=1,2,...,M 
O<t<T’ 
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where 
Sm = (2m — 1 — M)d, m=1,2,..., M. 


The signals are equally probable with amplitudes that are symmetric about zero and 
are uniformly spaced with distance 2d between adjacent amplitudes, as shown in 
Figure 8.41. 


8.37 In this chapter, we showed that an optimal demodulator can be realized as a 
correlation-type demodulator and a matched-filter-type demodulator, where in both 
cases, Yj (t) and 1 < j < N were used for correlating r (t) or designing the matched 
filters. Show that an optimal demodulator for a general M-ary communication sys- 
tem can also be designed based on correlating r(t) with s;(t) and 1 <i < Mor 
designing filters that are matched to s;(t)’s and 1 <i < M. Precisely describe the 
structure of such demodulators by giving their block diagram and all relevant design 
parameters, and compare their complexity with the complexity of the demodulators 
shown in Figures 8.35 and 8.36. 


8.38 A speech signal is sampled at a rate of 8 kHz, logarithmically compressed and 
encoded into a PCM format using 8 bits/sample. The PCM data is transmitted through 
an AWGN baseband channel via M-level PAM. Determine the symbol rate required 
for transmission when (a) M = 4, (b) M = 8, and (c) M = 16. 


8.39 Two equiprobable messages are transmitted via an additive white Gaussian noise 
channel with a noise power spectral density of a = 1. The messages are transmitted 
by the signals 

1 O<t<il 
t) = neers 
a) | 0 otherwise 


and s2(t) = sı (t — 1). 


We intended to implement the receiver using a correlation type structure, but due to 
imperfections in the design of the correlators, the structure shown in Figure 
P-8.39 has been implemented. The imperfection appears in the integrator in the upper 
branch where we have h 3 instead of h The decision box, therefore, observes 
rı and r2; based on this observation, it has to decide which message was trans- 
mitted. What decision rule should be adopted by the decision box for an optimal 
decision? 


He O ti 


r(t) Decision 
device 





ls 


Figure P-8.39 
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8.40 Consider a transmission line channel thatemploys n — 1 regenerative repeaters plus 
the terminal receiver in the transmission of binary information. We assume that 
the probability of error at the detector of each receiver is p and that errors among 
repeaters are statistically independent. 


1. Show that the binary error probability at the terminal receiver is 
1 
Py =5[-(-2p)"]. 


2. If p = 10% and n = 100, determine an approximate value of P,. 


8.41 A digital communication system consists of a transmission line with 100 digital 
(regenerative) repeaters. Binary antipodal signals are used for transmitting the infor- 
mation. If the overall end-to-end error probability is 1076, determine the probability 
of error for each repeater and the required ,/No to achieve this performance in 
AWGN. 


8.42 Derive the expression for the (one-sided) noise equivalent bandwidth of the PLL 
given by Equation (8.8.12). 


8.43 Suppose that the loop filter for a PLL has the transfer function 





1 
G(s) = ; 
s+/2 
1. Determine the closed-loop transfer function H (s) and indicate if the loop is 


stable. l 
2. Determine the damping factor and the natural frequency of the loop. 


8.44 Consider the PLL for estimating the carrier phase of a signal in which the loop filter 
is specified as 


G(s) = ———. 
(s) 1+ ts 


1. Determine the closed-loop transfer function H (s) and its gain at f = 0. 
2. For what range of values of tı and K is the loop stable? 


8.45 The loop filter G(s) in a PLL is implemented by the circuit shown in Figure P-8.45 
Determine the system function G(s) and express the time constants tı and 72, as 
shown in Equation (8.8.4), in terms of the circuit parameters. 


Ri 


R 
Input Output 


ores ee Figure P-8.45 
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8.46 The loop filter G(s) in a PLL is implemented with the active filter shown in Figure 


P-8.46. Determine the system function G(s) and express the time constants t; and 
T2, as shown in Equation (8.8.4), in terms of the circuit parameters. 


Rı C 
A NMN — 
y 


i___» > —o 
Figure P-8.46 


8.47 Show that the early—late gate synchronizer illustrated in Figure 8.80 is a close approx- 
imation to the timing recovery system illustrated in Figure P-8.47. 


9 | Sampler | 
re 
Matched a 
H filter | p + sgn(-) 
Loop 
MeO ~ | filter |“ Q 






Figure P-8.47 


(3s binary communication system transmits the same information on two diversity 
channels. The two received signals are 


n= +/6p, +n, 
ro = EV Eb + mM, 


where E(nı) = E(m) = 0, E(n?) = 07, E(n3) = o2, and n; and nz are uncor- 
related Gaussian variables. The detector bases its decision on the linear combination 
of rı and 72, i.€., 

r =r + krn. 


1. Determine the value of k that minimizes the probability of error. 


2. Plot the probability of error for o? = 1,07 = 3 and either k = 1 ork is the 
optimum value found in Part 1. Compare the results. 
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COMPUTER PROBLEMS 


8.1 Simulation of the Detector Performance for Binary Orthogonal Signals 








The purpose of this problem is to estimate and plot the probability of error as ; 
function of the SNR for a binary communication system that employs binary orthog 
onal signals that are transmitted over an additive white Gaussian noise channel. Thi 
| model of the binary communication system employing orthogonal signals is show! 
| in Figure CP-8.1. As shown, we simulate the generation of a binary sequence o 
| zeros and ones that occur with equal probability and are mutually statistically inde 
| pendent. To accomplish this task, we use a random number generator that generate 
uniform random numbers in the range (0, 1). If a number generated is in the rang 
(0, 0.5), the binary source output is a 0. Otherwise, it is a 1. If a zero is generated 
then yo = J/&, + no and yı = nı, where yo and yı represent the outputs of th 
two matched filters or the two correlators for the binary orthogonal signals. If a 1 i 
generated, then yo = no and yı = /%, + n; are generated. 






































The additive noise components ng and n; are generated by means of two Gaussia 
noise generators. Their means are zero and their variances are ø? = No/2. For con 
venience, we may normalize the signal energy €, to unity (€, = 1) and vary o* 
Note that the SNR, which is defined as ,/ No, is then equal to 1/ 207. The matche 
filter or correlator outputs yp and y; are fed to the detector, which decides whether a 
ora | has been received. The detector output is compared with the binary transmitte 
data sequence and an error counter is used to count the number of bit errors. 


Perform the simulation for the transmission of 10,000 bits at several different value 
of SNR, which cover the range of SNRs 0 < 10 log /No < 10dB. Plot the errc 
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Figure CP-8.1 Simulation model for binary orthogonal signal detection. 
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probability as a function of the SNR. Compare the estimated error probability with 
the theoretical error probability given by the formula 


8.2 Simulation of the Detector Performance for Binary Antipodal Signals 


The purpose of this problem is to estimate and plot the probability of error when 
binary antipodal signals are transmitted over an additive white Gaussian noise chan- 
nel. The model of the binary communication system employing antipodal signals is 
shown in Figure CP-8.2. As shown, we simulate the generation of the random vari- 
able y, which is the input to the detector. A uniform random number generator is 
used to generate the binary information sequence of zeros and ones from the data 
source. The sequence of zeros and ones is mapped into a sequence of +%,, where €, 
represents the signal energy per bit. A Gaussian noise generator is used to generate 
a sequence of zero-mean Gaussian numbers with variance o. For equally probable 
zeros and ones, the detector compares the random variable y with the threshold zero. 
If y > 0, the decision is made that the transmitted bit is a zero. If y < 0, the decision 
is made that the transmitted bit is a 1. The output of the detector is compared with 
the transmitted sequence of information bits, and the bit errors are counted. 


Perform the simulation for the transmission of 10,000 bits at several different values 
of SNR, which covers the range of SNRs 0 < 10l0g;ọ ,/No < 7. Plot the error 
probability as a function of the SNR. Compare the estimated error probability with 
the theoretical error probability given by the formula 
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Figure CP-8.2 Simulation model of the binary antipodal signal detection. 


8.3 Simulation of the Detector Performance for On-Off Signals 


The model for the system to be simulated is similar to that shown in Figure CP-8.2, 
except that one of the signals is zero. Thus, we generate a sequence of random 
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8.4 


8.5 


Digital Modulation in AWGN Channel Chapter 8 


variables {y;}, where either y; = n; when no signal is transmitted or y; = /@, + ni 
when the signal is wansmitted, and n; is a zero-mean Gaussian random variable with 
variance o? = No/2. Hence, the two possible PDFs or y; are shown in Figure CP-8.3, 
The detectot\compares each of the random variables {y;} to the optimum threshold 
J/&,/2.and makes the decision that a 0 was wansmitted when y; < /€,/2 and that 
a 1 was transmitted when y; > /€,/2. 


Perform the/simulation for the transmission of 10,000 bits at several different values 
of SNR that covers the range of SNRs 0 < log) €,/No < 13 dB. Plot the error prob- 
ability as a function of the SNR. Compare the estimated error probability obtained 
from the simulation with the theoretical error probability given by the formula 


Sp 
P =Q No 





r FigureCP-8.3 The probability density 
functions for the received signal at the output 
of the correlator for on-off signals. 


Effect of Noise in a Binary Communication System 


The effect of noise on the performance of a binary communication system can be 
observed from the received signal plus noise at the input to the detector. For example 
consider the binary communication system that employs binary orthogonal signals 
for which the input to the detector consists of a pair of random variables (yo, yı) 


where either 
Oo, y1) = (v Sp + No, nı) 


or 


Oo, y1) = (no. VE + nı) l 


The noise variables no and n; are zero-mean, independent Gaussian random variable: 


with equal variance o?. 


Perform the simulation as described in CP-8.1 to generate 100 samples of the receive: 
sequence (yo, yı) foreach value of o = 0.1,0 = 0.3, and o = 0.5, where €, i 
normalized to unity. Plot these 100 samples for each o as separate two-dimensiona 
plots with coordinates (yo, y;). Comment on the results. 


Simulation of the Detector Performance for 4-Amplitude PAM 


The purpose of this problem is to estimate and plot the probability of error as a func 
tion of the SNR for a digital communication system that employs four-level PAM 
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The receiver is assumed to consist of a signal correlator or a matched filter followed 
by a detector. The model for the system to be simulated is shown in Figure CP-8.5. 
As shown, we simulate the generation of the random variable y, which is the output 
of the signal correlator and the input to the detector. 


We begin by generating a sequence of quaternary symbols that are mapped into cor- 
responding amplitude levels {A,,}. To accomplish this task, we use a random number 
generator that generates a uniform random number in the range (0, 1). This range is 
subdivided into four equal intervals (0, 0.25), (0.25, 0.5), (0.5, 0.75), and (0.75, 1), 
where the subintervals correspond to the symbols (pairs of information bits) 00, 01, 
11, and 10, respectively. Thus, the output of the uniform random number genera- 
tor is mapped into the corresponding signal amplitude levels (—3d, —d, d, and 3d), 
respectively. 


The additive noise component having mean zero and variance o? is generated by 
means of a Gaussian random number generator (RNG). For convenience, we may 
normalize the distance parameter d = 1 and vary o?. The detector observes y = 
Am +n and computes the distance between y and the four possible transmitted signal 
amplitudes. Its output Am is the signal amplitude level corresponding to the smallest 
distance. Âm is compared with the actual transmitted signal amplitude, and an error 
counter is used to count the errors made by the detector. 


Perform a simulation for the transmission of 10,000 symbols at different values of 
the average bit SNR, which is defined as 


Cavb = 5/d z 
N 4\o?)’ 
and plot the estimated probability of a symbol error as a function of 10 log (avb / No). 


Also plot the theoretical expression for the probability of error and compare the 
simulated performance to the theoretical performance. 
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Figure CP-8.5 Simulation model of four-level PAM signal detection. 
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8.6 


8.7 


8.8 
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Simulation of 16-Amplitude PAM 


Modify the simulation described in CP-8.5 to wansmit 16-level PAM. In this case 
the 16-ary symbols may be generated directly by subdividing the interval (0, 1) int 
16 equal-width subintervals and mapping the 16-ary symbols into the 16-ary signa 
amplitudes. Perform the simulation and plot the estimated error probability. Also 
plot the theoretical probability of a symbol error given by the expression in the tex 
and compare the estimated values to the theoretical values. 


Generation of PSK Signal Waveforms 


The objective of this problem is to generate constant envelope PSK signal waveform: 
described mathematically by the expression 


/26, 2mm =0,1,2,...,M—-—1 
Um(t) = Te eos (an fir + E), eT ; 


For convenience, the signal amplitude may be normalized to unity. 


Generate and plot the PSK signal waveforms for the case in which fe = 6/T an 
M = 8 over the time interval 0 < t < T. 


Demodulation of PSK Signals 


In this problem, we consider the demodulation of a M = 4 PSK signal waveforn 
r(t) as given by 


r(t) = um(t) + n(t) 
= Um(t) +n-(t) cos 27 fet — ns(t) sin 27 fet, 


where the transmitted signal is 


Um(t) = 8r (t) cos (=) cos 27 fet — gr(t) sin (= ) sin 2x fet 
= SmeWi(t) + Sms W(t) 


and n(t) is the additive Gaussian noise process. The pulse shape gr (t) is rectangula: 
i.e., 





2%. 
o= 7 O<t<T 


otherwise 


The demodulator employs two quadrature correlators to compute the two inputs t 
the detector at the sampling time T: 


Y(t) -f r(t)yi(t) dr, 


ys(t) = l r(t)W2(t) dt, 


8.9 





SEE OEE LT eT ee 
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where yı (t) and y(t) are two orthogonal basis waveforms given by 


l 
p(t) = y z gr (t) cos 27 fet, 
1 f 
plt) = —, | z gr (t) sin 2x fet. 


Let us implement the two correlators in discrete time. Thus, the two outputs of the 
correlators are 
n 
ye(nTs) = X rE) n=1,2,..., 
k=0 
n 
Ys (NTs) = X r(kTs)Wo(kTs), n=1,2..., 
k=0 


where the sampling interval is Ts = T/100 and the carrier frequency fe = 30/ T. The 
noise samples ne(kTs) and ns(kT;) are statistically independent, zero-mean Gaussian 
with variance o?. Perform the computation and plot ye(nTs) and ys(n7;) for n = 
1,2,..., 100, o? = 0, o? = 0.05, o? = 0.5 and each of the phases in a four-phase 
PSK signal. 


Simulation of M = 4 PSK Modulation and Detection 


The objective of this problem is to estimate the probability of error for a communi- 
cation system that employs M = 4 PSK modulation. The model for the system to be 
simulated is shown in Figure CP-8.8. 


As shown, we simulate the generation of the random vector y = Sm + n, which is 
the output of the signal correlator and the input to the detector. We begin by generat- 
ing two-bit symbols that are mapped into the corresponding four-phase signal points. 
To accomplish this task, we use arandom number generator that generates a uniform 
random number in the range (0, 1). This range is subdivided into four equal intenvals, 
(0, 0.25), (0.25, 0.50), (0.50, 0.75), and (0.75, 1.0), where the subintervals c 
spond to the pairs of information bits 00, 01, 11, and 10, respectively. These pairs ‘ef 
bits are used to select the signal phase vectors Sm, m = 1, 2, 3, 4. 





The additive noise components nc and ns are statistically independent zero-mean 
Gaussian random variables with an equal variance o?. For convenience, we may 
normalize the signal energy @, to unity and control the SNR by scaling the noise 


variance o°. 


The detector observes the received signal vector y in each symbol interval and com- 
putes the projection (dot product) of y into the four possible signal vectors Sm. Its 
decision is based on selecting the signal point corresponding to the largest projection. 
The output decisions from the detector are compared with the transmitted symbols, 
and symbol errors and bit errors are counted. 
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Figure CP-8.9 Simulation model of M = 4 PSK signal detection. 


Perform the simulation of the four-phase system just described for 10,000 symbol 
(20,000 bits) for the range of SNR 0 < 101og;,@,/No < 8 dB, and plot the esti 
mated probability of a bit error and probability of a symbol error. Also, plot the thec 
retical bit error probability for M = 4 PSK and compare the simulated performanc 
to the theoretical error probability. Comment on the results. 


Simulation of M = 4 Differential PSK Modulation and Detection 


The objective of this problem is to estimate the probability of error for a communi 
cation system that employs M = 4 differential PSK (DPSK) modulation. The mode 
for the system to be simulated is shown in Figure CP-8.9. 


The M = 4DPSK mapper changes the phase of the transmitted signal relative t 
the phase of the previous transmission as follows: (a) by 0 radians if the bits to b 
transmitted are 00, (b) by 1/2 radians if the bits to be transmitted are 01, (c) by : 
radians if the bits to be transmitted are 11, and (d) by 3/2 radians if the bits t 
be transmitted are 10. Two Gaussian random noise generators (RNG) are used t 
generate the noise components (nc, ns), which are uncorrelated, are zero mean, an 
have the variance a”. Hence, the received signal-plus-noise vector is 


nm . nm 
y= [cos 5 +n: sin > + ns] 


= [ye ys] . 


The differential detector basically computes the phase difference between two suc 
cessive received signal vectors y, and y,_,. Mathematically, this computation ca 
be performed as 
VVe—- = Nek + FY sk) Yek-1 — FY sk—1) 
=y tI Yir 
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and & = tan™!(y;/y,) is the phase difference. The value of 6; is compared with 
the possible phase differences (0°, 90°, 180°, and 270°), and a decision is made in 
favor of the phase that is closest to @. The detected phase is then mapped into the 
pair of information bits. The error counter counts the symbol errors in the detected 
sequence. 


Perform the simulation of the M = 4 DPSK system as previously described for 
10,000 symbols (20,000 bits) for the range of SNR 0 < 10log,)€@,/No < 10 dB, 
and plot the estimated probability of a symbol error. Also, plot the theoretical symbol 
error probability for M = 4 PSK and compare the simulated performance for M = 4 
DPSK to the theoretical error probability for M = 4 phase coherent PSK. Comment 
on the results. 
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Figure CP-8.10 Simulation model of M = 4 DPSK signal detection. 


8.11 Demodulation of QAM Signals 


The demodulator for a QAM signal employs two quadrature correlators that cross 
correlate the received signal given by 


r(t) = Ame8r(t) cos(2x fet +$) + Ams8r(t) sin(2x fet + $) + n(t) 


with the phase quadrature basis functions given by 


1 
Wit) = [een cos(2n fot + ġ), 


Y(t) = -f 8T (t) sin(2x fet + $). 
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8.12 
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The pulse shape gr (t) is rectangular; i.e., 


| Ês 
=| T Oster. 


0 otherwise 





Let us implement the two correlators in discrete time. Thus, the two outputs of tt 
correlators are 


ye(nTs) = Yorks) WikTs), n=1,2,..., 
k=0 

Y(T) = X rkTs)¥o(kT;), n=1,2,..., 
k=0 


where the sampling interval is Ts = T /100 andthe carrier frequency fe = 30/T. Tt 
carrier phase @ may be selected to be uniformly distributed in the interval (0, 2x 
and the additive noise samples n-(kT;) and ns(kT;) are statistically independer 
zero-mean Gaussian with variance o?. Perform the computation and plot y,(n7 
and y,(nT;) for n = 1,2,..., 100, o? = 0, o? = 0.05, o? = 0.5 forthe M = 
QAM signal constellation shown in Figure 8.56(b). We may select any one of tł 
eight signal points for transmission. 


Simulation of M = 16 QAM Modulation and Detection 


The objective of this problem is to estimate the probability of error for a commun 
cation system that employs M = 16 quadrature-amplitude modulation (QAM). T} 
model of the system to be simulated is shown in Figure CP-8.12(a). The unifor 
random number generator (RNG) is used to generate the sequence of informatic 
symbols corresponding to the 16 possible 4-bit combinations of bj, b2, b3, and b 
The information symbols are mapped into the corresponding signal points, as shov 
in Figure CP-8.12(b), which have the coordinates [Amc, Ams]. Two Gaussian RNC 
are used to generate the uncorrelated, zero-mean noise components [ne, ns], each: 
which has a variance o*. Consequently, the received signal-plus-noise vector at tl 
input to the detector is 


Y = [Amc +n: Ams +ns]. 


The detector computes the Euclidean distance between y and each of the 16 possib 
transmitted signal vectors; then it selects the signal pointthat is closest to the receiv: 
vector y. The error counter counts the symbol errors in the detected sequence. 


Perform the simulation of the M = 16 QAM system as described for 10,000 syr 
bols (40,000 bits) forthe range SNR 0 < 10log,9 (Gbav/No) < 13 dB, and plot ti 
estimated probability of a symbol error. Also, plot the theoretical symbol error pro 
ability for M = 16 QAM and compare the simulated performance to the theoretic 
error probability. Comment on the results. 
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Figure CP-8.12 Simulation model for M = 16 QAM signal detection. 


8.13 Repeat Example 8.3.3 of the correlation-type demodulator when the received signal 
is corrupted by AWGN, where the noise samples have variances o? =0.1, 0.5, and 
1.0. 


8.14 Repeat Example 8.3.5 of the matched-filter demodulator when the received signal is 
corrupted by AWGN, where the noise samples have variances o? = 0.1, 0.5, and 
1.0. 


8.15 Simulation of Early—-Late Gate Synchronizer for PAM 


The objective of this problem is to simulate the operation of an early—late gate syn- 
chronizer for a binary PAM system. The basic pulse used in PAM has a raised cosine 
spectrum with a roll-off factor of 0.4. The system transmission rate 1/T = 4800 
bits/sec. Consequently, the pulse is given as 


; cos 1920xt 
D = sine(4800) — LSL Amt 
xA) = emea B00 T 746x 10 


This signal pulse extends from —oo to +00. 
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Plot x(t) and verify that, for all practical purposes, it is sufficient to consider only 
the interval |t| < 0.6 x 1073, which is roughly [—37, 3T]. Truncate the pulse tc 
this interval and compute the autocorrelation function. Plot the autocorrelation anc 
determine its length in samples and the position of its maximum, i.e., the optimun 
sampling time. Select the sampling rate to be 40 samples per bit interval T. 


Simulate the early—late gate synchronizer when the incorrect sampling is to the righ 
or to the left of the peak by 10 samples and verify that the synchronizer finds th 
correct sampling time, i.e., the maximum of the pulse. 
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In this chapter, we expand the treatment of digital modulation waveforms to multidi- 
mensional signals. We develop the geometric representation of M-ary orthogonal, and 
biorthogonal signals, simplex signals, and binary-coded signals. Then, we evaluate the 
average probability of error when these signals are transmitted through an additive white 
Gaussian noise (AWGN) channel. We also describe M-ary frequency-shift keying (FSK) 
signals, their modulation and their detection, including noncoherent detection and the prob- 
ability of error for noncoherent detection. The reader is also introduced to continuous- 
phase FSK (CPFSK) signals and continuous-phase modulation (CPM). A comparison of 
the various modulation methods presented in this chapter and Chapter 8 is described on 
the basis of their power requirements, their bandwidth requirements and their probability 
of error performance. 


= 9.1 M-ARY ORTHOGONAL SIGNALS 


M-ary orthogonal signal waveforms at baseband can be constructed in a variety of ways. 
For example, Figure 9.1 illustrates two sets of M = 4 orthogonal signal waveforms. We 
observe that the signal waveforms s;(t) andi = 1,2,3,4 in Figure 9.1(a) do not over- 
lap over the time interval (0, T), and they represent pulse position modulation (PPM) for 
M = 4. The signal waveforms s{(t) and i = 1, 2, 3, 4 in Figure 9.1(b) are overlapping in 
time, but still satisfy the orthogonality condition, namely, 


T 
f s(t)s;(t)dt =0, i # j. (9.1.1) 
0 


The number of dimensions required to represent a set of M orthogonal waveforms is 
N = M. Hence, asetof M orthogonal signal waveforms can be represented geometrically 
by M orthogonal vectors in M-dimensional space. To be specific, let us consider M-ary 
PPM signal waveforms. Such a set of M baseband PPM signals are expressed mathemati- 
cally as 


sm(t) = /€sVn (t), m= 1,2,.,M, (9.1.2) 
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Figure 9.1 Two sets of M = 4 orthogonal signal waveforms. 





Section 9.1 M-ary Orthogonal Signals 2 487 


& r(t) Yn @ 





VMIT 


Figure 9.2 Rectangular pulse 
8r(t) and basis function ¥,, (t) 

. for M-ary PPM signals 
waveform. 





where YWm(t) and m = 1, 2, .., M are a set of M orthogonal basis waveforms. These wave- 
form are defined as 


— 1.3 
ry; <t< (9.1.3) 


(m —1)T (m —1)T mT 
Wm(t) = 8r (- =) -Hn M’ 


in which g7(t) is a unit energy pulse, which is nonzero over the time interval 0 < t < T/M 
and the basis functions WYm(t) and m = 1,2,..., M are simply time-shifted replicas of 
gr (t), as illustrated in Figure 9.2. Each signal waveform Sm (t) has energy 


f szadi =8 R (t)dt = $,, all 
UG Per. Se ole enn en ee 


Es denotes the energy of each of the signal waveforms representing k-bit symbols. Con- 
sequently, M-ary PPM signal waveforms are represented geometrically by the following 
M -dimensional vectors: 
sı =(V%;,0,0,...,0); 
Qo = (0, Vs, 0,...,0); 


sm = (0,0,0, ..., V8). 


Clearly, these vectors are orthogonal, i.e., s;-s; = 0 when i # j. It is also interesting to 
note that the M signal vectors are mutually equidistant, i.e., 


dmn = VI 8m — Sn |? = V28s, for allm £n. (9.1.5) 


(9.1.4) 


Example 9.1.1 


Determine the vectors in a geometric representation of the M = 4 signal waveforms s/(t) and 
i = 1, 2, 3, 4, that are shown in Figure 9.1(b). Use the basis waveforms Ym (t) that are shown 
in Figure 9.2 


Solution We note that the four orthogonal waveforms have equal energy, given by 


if i aS [ane 
Jo [s] T Jo j 


By computing the projection of each signal waveform on the four basis waveforms Yn (t), i.e., 


T 
j Si(t)Ym(t)dt, m = 1, 2, 3, 4, 
0 
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we obtain the vector s;. Thus, we obtain 


1 = (VEJE, VEJA, VBA, VE); 

s, = (VEA, VEA, -VB/4, - VE); 
s) = (VEA, VEA, VEs14, V84); 
s, = (V64, -VE8./4, —VEs14 , VEA). 


We observe that these four signal vectors are orthogonal, i.e., s;:s} = Ofori # j. l 


a 
= 
| 








The baseband M -ary orthogonal signals described above can be transmitted on band 
pass channels by simply multiplying the baseband signals by the appropriate carrie 
cos 21 fet. Consequently, the transmitted signal is a double-sideband suppressed-carric 
(DSB-SC) amplitude-modulated signal. As in our discussion of carrier-modulated signal 
in Chapter 8, the received signal carrier is generally shifted in phase, necessitating th 
need for a phase-locked loop (PLL) to estimate the carrier phase in order to perform sign: 
demodulation. Once the received signal is converted to baseband by the demodulator, th 
detector operates as in the case of baseband signal transmission in computing the distanc 
metrics and making decisions on the wansmitted symbols. 


9.1.1 Probability of Error for M-ary Orthogonal Signals 


In deriving the probability of error for the general class of M-ary orthogonal signals, 
is convenient to use PPM signals, which have the simple vector representation given i 
Equation (9.1.4), where @, is the energy for each of the signal waveforms. 

For equal energy orthogonal signals, the optimum detector selects the signal resultin 
in the largest cross correlation between the received vector y and each of the M possib) 
transmitted signal vectors {Sm}, i.e., 


M 
CY, Sm) = Y-Sm =Y ksm  m=1,2,..., M. (9.1. 
k=1 


To evaluate the probability of error, let us suppose that the signal sı is transmitte: 


Then the vector at the output of the M cross correlators or M matched filters and, thus, tt 
input to the detector is the vector 


y = (VE +m, nn,- nm), (9.1. 


where 71, n2, ..., nm are zero-mean, mutually statistically independent Gaussian rando: 
variables with equal variance o? = No/2. Substituting Equation (9.1.7) into Equatic 
(9.1.6), we obtain 

C(y,s1) =s (VE; +n); 


C(y, 82) = s&n; 
9.1: 


C(y, su) = VEsnm. 
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Note that the scale factor vé, may be eliminated from the correlator outputs by dividing 


each output by vés. Then, with this normalization, the probability density function (PDF) 
of the first correlator output (y1 = Vs + nı) is 


a (rvB) m 





1 
= — 9.1.9 
fon RNG (9.1.9) 
and the PDFs of the other M — 1 correlator outputs are 
1 z 2 fN 
= em 0, m=2,3,...,M. (9.1.10) 
JOm) JaNo 


It is mathematically convenient to first derive the probability that the detector makes 
a correct decision. This is the probability that yı is larger than each of the other M — 1 


correlator outputs 2, n3, ..., nm. This probability may be expressed as 
wo 
Po= f| Pena < yina < yian < nb fy ODAV 0.1.11) 
—00 
where P(n2 < y1, n3 < Yy1,... nm < Yılyı) denotes the joint probability that n2, n3, .., 


ny are all less than yı, conditioned on any given y1. Then this joint probability is averaged 
over all yı. Since the {ym} are statistically independent, the joint probability factors into a 
product of M — 1 marginal probabilities of the form 


fl 
Pom < yin) = | POO ia =D 
—00 


1 pv2v7/No 


ZAR EAS —yz /2 
oy i a (9.1.12) 
2y? 
= 7 Z2 
Q No 
These probabilities are identical for m = 2,3,..., M; hence, the joint probability under 


consideration is simply the result in Equation (9.1.12) raised to the (M — 1) power. Thus, 
the probability of a correct decision is 


M-1 
Pe = Ps Oa fOr) dyi (9.1.13) 
a: No 
and the probability of a k-bit symbol error is 


Py =1-— P.. (9.1.14) 
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Therefore, 
1 f” Mead j 
Ma T L (1-1 — gayi- e bvm) 12 ay. (9.1.15) 


The same expression for the probability of error is obtained when any one of the other 
M — 1 signals is transmitted. Since all the M signals are equally likely, the expression for 
Pm given in Equation (9.1.15) is the average probability of a symbol error. This expression 
can be evaluated numerically. 

To make a fair comparison of communication systems, we want to have the proba- 
bility of error expressed in terms of the signal-to-noise ratio (SNR)/bit, 6,/No, instead of 
the SNR/symbol, @;/No. This is important because, depending on the size of constellation, 
different systems carry a different number of bits/signal. With M = 2", each symbol con- 
veys k bits of information; hence, s = kp. Thus, Equation (9.1.15) may be expressed in 
terms of ,/No by substituting for €s. 

Sometimes, we also want to convert the probability of a symbol error into an equiv- 
alent probability of a binary digit error. For equiprobable orthogonal signals, all symbol 
errors are equiprobable and occur with probability 


Pu Pm 


Furthermore, there are (*) ways in which n bits out of k may be in error. Hence, the 
average number of bit errors per k-bit symbol is 


k k-1 
: k\ P 2 


n=} 


and the average bit-error probability is simply the result in Equation (9.1.17) divided by k, 
the number of bits/symbol. Thus, 


pya H 9.1.18) 
Pe = Set k> 1. (9.1.18 
The graphs of the probability of a binary digit error as a function of the SNR/bit 
€»p/No, are shown in Figure 9.3 for M = 2, 4, 8, 16, 32, 64. This figure illustrates that by 
increasing the number M of waveforms, we can reduce the SNR/bit required to achieve < 
given probability of a bit error. For example, to achieve a P, = 1075, the required SNR/bi 
is a little more than 12 dB for M = 2, but if M is increased to 64 signal waveform: 
(k = 6 bits/symbol), the required SNR/bit is approximately 6 dB. Thus, a savings of ove: 
6 dB (a factor of four reduction) is realized in the transmitter power (or energy) requirec 
to achieve a P, = 107 by increasing M from M = 2 to M = 64. Note that this behavio: 
is in complete contrast with M-ary pulse amplitude modulation (PAM), where increasins 
M increases the error probability. 
What is the minimum required €,/No to achieve an arbitrarily small probability 
of error as M —> oo? To answer this question, we apply the union bound, derived i! 
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Figure 9.3 Probability of a bit 
error for optimum detection of 
orthogonal signals. 
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Section 8.4.2 for a general equiprobable M-ary signaling scheme, to the special case of an 
M.-ary orthogonal signaling system as described in the following section. 





9.1.2 A Union Bound on the Error Probability of M-ary Orthogonal Signals 


Let us investigate the effects of increasing M on the probability of error for orthogonal sig- 
nals. To simplify the mathematical development, we use the union bound (see Section 8.4.2) 
to derive an upper bound on the probability of a symbol error. This is much easier than the 
exact form given in Equation (9.1.15). 

As we discussed earlier [see Equation (9.1.5)], M-ary orthogonal signals are equidis- 
tant with 


L 
E 
3 


dÈ mi = Sm — Sm l? = 28s; (9.1.19) 


m T 
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therefore, : 
dmin = V 26s. (9.1.20) 


Using this value of dmin in Equation (8.4.42), we obtain the union bound on the error prob- 
ability of an M-ary orthogonal signaling system as 


M-1 _%s E 


Pu < o < Me 20. (9.1.21 


Thus, using M = 2‘ and €, = k6p, we have 
Py <2 eK E,/2No 


= o-k(Ep/No-21n2)/2_ (9.1.22 


As k — œ or equivalently, as M — oo, the probability of error approaches zero exponen. 
tially, provided that @,/Npo is greater than 2 In 2, i.e., 


sl) > 2In2 = 1.39 © 1.42 dB. (9.1.23 
No 
The simple upper bound on the probability of error given by Equation (9.1.22) implie; 
that as long as SNR > 1.42 dB, we can achieve an arbitrarily low Py. However, thi, 
union bound is not a very tight upper bound at low SNR values. In fact, by more elaborat: 
bounding techniques, it can be shown that Py — 0 as k — oo, provided that 


Sy > In2 = 0.693 ~ —1.6dB. (9.1.24 
No 
Hence, —1.6 dB is the minimum required SNR/bit to achieve an arbitrarily small proba 
bility of error in the limit as k — œo (M — oo). This minimum SNR/bit (—1.6 dB) i 
called the Shannon limit for an additive white Gaussian noise channel. More discussio: 
on the Shannon limit and how the value of —1.6 dB is derived is given in Section 12.6 i 
Equation (12.6.7) and the discussion leading to it. 


9.2 BIORTHOGONAL SIGNALS 


In general, a set of M biorthogonal signals are constructed from a set of M/2 orthogc 
nal signals [s;(t), i = 1,2, ..., M/2] and their negatives [—s;(t),i = 1,2,..., M/Z 
As we shall observe later in this chapter, the channel bandwidth required to transmit th 
information sequence via biorthogonal signals is just one-half of that required to transm 
M orthogonal signals. For this reason, biorthogonal signals are preferred over orthogon: 
signals in some applications. 

To develop the geometric representation of M-ary biorthogonal signals, let us cor 
sider PPM signals. We begin with M/2 PPM signals of the form 


S(t) = VEs Wm(t), m=1,2,...,M/2, 0<t<T, (9.2.1 





a 
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Figure 9.4 Rectangular pulse g;(t) and basis function W(t) for M-ary biorthogonal PPM signal waveforms. 


where Wn(t),m = 1,2,...., M/2, are a set of M/2 basis waveforms as illustrated in 
Figure 9.4 and defined mathematically as 
(m —1)T (m —1)T mT 
t= ARE E a —, 9.2.2 
Vim (t) er( Mp? M2 ~' up (9.2.2) 


where gr (t) is also shown in Figure 9.4. The negatives of these signal waveforms are 
SK 4m(t) = -Sm (t) = —VEsYm(t), m= 1,2, ...M/2. (9.2.3) 


We observe that the number of dimensions needed to represent the M/2 orthogonal 
signals is N = M/2. In the case of PPM signals, the signal vectors of M/2 dimensions are 


S1 = (V€;,0,0,...,0); 
s2 = (0, V%;,0,...,0); (9.2.4) 


sm2 = (0,0,0,..., VEs). 


The other M/2 signal vectors corresponding to the signal waveforms that are the negatives 
of the M/2 PPM signal waveforms are 


sm, = (-v@;, 0,0,...,0); 


2 


$42 =(0, -VB;, 0, ...,0); 


(9.2.5) 
su = (0,0,0,...,-V;). 
Example 9.2.1 
Determine the vector representation of the four biorthogonal signal waveforms shown in 
Figure 9.5(a). 


Solution The signal waveform s(t) and s2(t) are orthogonal and are represented by the 
vectors 


S; = (v8, 0) 
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and 


s2 = (0, vé). 


The signal waveforms s3(t) = —s,(t) and s4(t) = —s2(t). Therefore, the geometric represen. 
tation of s3(t) and s4(t) is 


j s3 = (-v%., 0) 




































































and 
S4 = (0, -v%,). a 
The signal points corresponding to these vectors are shown in Figure 9.5(b). It is interesting tc 
note that the signal points shown in Figure 9.5 for the M = 4 biorthogonal signals are identical 
to the signals points in a four-phase phase-shift keying (PSK) signal. I 
s1(4) 52(0) 
1 
V26,/T 
t T 
0 T/2 
53(1) Sa(t) 
T/2 T/2 T 
>t =t 
0 
i -V2%,IT -VZIT + 
(a) 
(p S2 
nj 
Hid 
i 
l P Sq 
| 
| ©) 
Hi 
Figure 9.5 M = 4 biorthogonal signal waveforms and signal point constellation for Example 9.2.1. 
ihi; 
| i 
| 
i 
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In a general set of M biorthogonal signals, the distance between any pair of signal 
vectors is eitherdmn = ./26, when the two signal vectors are orthogonal or dmn = 2V8; 
when one of the two signal vectors is the negative of the other (antipodal). 

As in the case of M-ary orthogonal signals, the baseband biorthogonal signals can 
be transmitted on bandpass channels by simply multiplying the baseband signals by the 
appropriate carrier cos 2 ft to produce a DSB-SC amplitude-modulated signal. 


9.2.1 Probability of Error for M-ary Biorthogonal Signals 


As previously indicated in Section 9.2, a set of M = 2* biorthogonal signals are con- 
structed from M/2 orthogonal signals by including the negatives of the orthogonal signals. 
Thus, we achieve a reduction in the complexity of the demodulator for the biorthogo- 
nal signals relative to that for the orthogonal signals. This occurs because the former is 
implemented with M /2 cross correlators or matched filters, whereas the latter requires M 
matched filters or cross correlators. Biorthogonal signals are also more bandwidth efficient 
than orthogonal signals. 

The vector representation of biorthogonal signals is given by Equations (9.2.4) and 
(9.2.5). To evaluate the probability of error for the optimum detector, let us assume that the 
signal sı (t) corresponding to the vector sı = (/;, 0, 0, ...,0) was transmitted. Then, 
the received signal vector is 


y= (VE + N1,N2,..., nmn) i (9.2.6) 


where the {nm} are zero-mean, mutually statistically independent, and identically dis- 
tributed Gaussian random variables with the variance o? = No/2. The optimum detec- 
tor decides in favor of the signal corresponding to the largest in magnitude of the cross 
correlators, or 


M/2 
C(I, Sm) =¥Sm =F YeSmks m= 1,2,...,M/2, (9.2.7) 
k=1 


while the sign of this largest term is used to decide whether sm(t) or —sm(t) was trans- 
mitted. According to this decision rule, the probability of a correct decision is equal to 
the probability that y} = VEs +n, > Oand y; exceeds |ym| = |am| for m = 2,3,..., 
M /2. But 





1 yı 
P (nml < yı \y1 > 0) = eo *"/No dx 
ANo J-y, 


1 yi// No/2 


= —x?/2 
= — e™ 2 dx, (9.2.8) 
V 2T J —y1/4/No/2 


so the probability of a correct decision is 


M_ 
oo 1 Y1/4/ No/2 2 2 a F 
pe i ae l e*Pdx| fondy. 
e Jo | 25 d-n ymi 
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Upon substitution for f (yı), we obtain 


2% s/No 
as) Ledu, 0.29) 


R= f mhea Jim 


where we have used the PDF of y, given in Equation (9.1.9). Finally, the probability of a 
symbol error Py = 1 — P}. 

P. and, hence, Py may be evaluated numerically for different values of M from 
Equation (9.2.9). The graph shown in Figure 9.6 illustrates Py as function of @,/No. 
where s = k@, for M = 2,4, 8, 16, and 32. We observe that this graph is similar tc 
that for orthogonal signals (see Figure 9.3). However, in this case, the probability of erroi 
for M = 4 is greater than that for M = 2. This is because we have plotted the symbo! 
error probability Py in Figure 9.6. If we plot the equivalent bit error probability, we woulc 
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find that the graphs for M = 2 and M = 4 coincide. As in the case of orthogonal signals, 
as M — oo (ork — oo), the minimum required €,/No to achieve an arbitrarily small 
probability of error is —1.6 dB, the Shannon limit. 





9.3 SIMPLEX SIGNALS 


Another set of M signal waveforms that can be constructed from M orthogonal signals is 
obtained by subtracting the average of the M orthogonal signals from each of the orthogo- 
nal signal waveforms. The M signal waveforms that result are called simplex signal wave- 
forms. Thus, if we have M orthogonal baseband signal waveforms {sm(t)}, the simplex 
signal waveforms, denoted as {s/,(t)}, are constructed as 


M 
; 1 
S(t) = S(t) — 5 Yo s(t). (9.3.1) 
k=1 
Then, it follows that (see Problem 9.3) the energy of these signals sj, (£) is 
T 2 1 
g€ = f [Ol dt = (1 — z) E (9.3.2) 
0 M 
and 
F E 
' (Ds, (t) dt = -——— ; .3. 
| AOO =- men (933) 


where ©, is the energy of each of the orthogonal signals and €/ is the energy of each of the 
signals in the simplex signal set. Note that the waveforms in the simplex set have smaller 
energy than the waveforms in the orthogonal signal set. Second, we note that simplex signal 
waveforms are not orthogonal. Instead, they have a negative correlation, which is equal 
for all pairs of signal waveforms. We surmise that among all the possible M-ary signal 
waveforms of equal energy @,, the simplex signal set results in the smallest probability of 
error when used to transmit information on an additive white Gaussian noise channel. 

The geometric representation of a set of M simplex signals is obtained by subtracting 
the mean signal vector from a set of M orthogonal vectors. Thus, we have 


M 
1 
S, = Sm — ij Shs m=1,2,...,M. (9.3.4) 
k=l 
The effect of subtracting the mean signal 


iv 
Ss = — .3.5 
Ky T 2 Sk (9.3.5) 


from each orthogonal vector is to translate the origin of the M orthogonal signals to the 
point 5 and to minimize the energy in the signal set {s’,}. 
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If the energy per signal for the orthogonal signals is $; =|| Sm ||’, then the energy 
for the simplex signals is 
E =I si, I5 Sm — 5 I? 
~ (9.3.6) 
=(1- 7) 8 


The distance between any two signal points is not changed by the translation of the ori- 
gin, i.e., the distance between signal points remains at d = VIZ. Finally, as indicated, 
the M simplex signals are correlated. The cross-correlation coefficient (normalized cross 
correlation) between the mth and nth signals is 


s' es! 
Is), Wil st, |l 
—1/M -1 


(l-—1/M) M-—-1' 


Ym = 
(9.3.7) 


Hence, all the signals have the same pairwise correlation. Figure 9.7 illustrates a set 
of M = 4 simplex signals. a 

As in the case.of M-ary orthogonal signals, the baseband simplex signals can be 
transmitted on bandpass channels by simply multiplying the baseband signals by the appro- 
priate carrier cos 27 fct to produce a DSB-SC amplitude-modulated signal. 


9.3.1 Probability of Error for M-ary Simplex Signals 


Next, we consider the probability of error for M simplex signals. Recall from Section 9.3 
that simplex signals area set of M equally correlated signals with a mutual cross-correlatior 
coefficient Ymn = —1/(M — 1). These signals have the same minimum separation oj 
/2€, between adjacent signal points in M-dimensional space as orthogonal signals. They 
achieve this mutual separation with a transmitted energy of 6,(M — 1)/M, which is less 
than that required for orthogonal signals by a factor of (M — 1)/M. Consequently, the 
probability of error for simplex signals is identical to the probability of error for orthogona: 
signals, but this performance is achieved with a savings of 


S2 


S3 $1 


S4 Figure 9.7 Signal constellation for the M = 4 simplex signals. 
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10 log(1 — Ymn) = 10 log dB (9.3.8) 


M-1 
in SNR. Notice that for M = 2, i.e., binary modulation, the simplex signals become antipo- 
dal; consequently, the transmitted signal energy (or power) required to achieve the same 
performance as binary orthogonal signals is 3 dB less. Therefore, the result in Equation 
(9.3.8) is consistent with our previous comparison of binary orthogonal signals and antipo- 
dal signals. For large values of M, the difference in performance between orthogonal sig- 
nals and simplex signals approaches zero. 


; 94 BINARY-CODED SIGNALS 


Signal waveforms for transmitting digital information may also be constructed from a set 
of M binary code words of the form 


Cm = (Cm1, Cm2 ---,CmN) m=1,2,...,M, (9.4.1) 


where Cmj = Oor 1 forall m and j. In this form, N is called the block length, or dimension, 
of the code words. Given M code words, we can construct M signal waveforms by mapping 
a code bit Cmj = 1 into a pulse gy (t) of duration T/N and a code bit cm; = O into the 
negative pulse —g7 (t). 


Example 9.4.1 
Given the code words 


& =[11110] 


&=[11001] 
cg =[10101) 
c4 =[0101 0] 


construct a set of M = 4 signal waveforms, as previously described, using a rectangular pulse 
&r(t). 


Solution As indicated, a code bit 1 is mapped into the rectangular pulse g7(t) of duration 
T/5, and a code bit 0 is mapped into the rectangular pulse —g7(t). Thus, we construct the 
four waveforms shown in Figure 9.8 that correspond to the four code words. a 


Let us consider the geometric representation of a set of M signal waveforms gener- 
ated from a set of M binary words of the form 


Gn = (Cm1,Cm2s +++» CmN)> n= 1, 2,...,M, (9.4.2) 


where Cmj = 0 or 1 for all m and j. The M signal waveforms are of dimension N and are 
represented geometrically in vector form as 


Sin = (Sm1, Sm2s ... »Smn)> n= l, 2, ony M, (9.4.3) 


where Sj = +,/%,;/N for all m and j. Thus, the energy of each signal waveform is @,. 
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isı (t) s2 (t) 


TIS 27/5 3T/5 








-V%jF 4- — -vV5&IT |— B E 


Figure 9.8 A set of M = 4 signal waveforms of dimension N = 5 constructed from the four code words in 
Example 9.4.1. 


In general, there are 2™ possible signals that can be constructed from the 2” pos: 
sible binary code words. The M code words are a subset of the 2" possible binary cod 
words. We also observe that the 2” possible signal points correspond to the vertices of a 
N-dimensional hypercube with its center at the origin. Figure 9.9 illustrates the signa 
points in N = 2 and N = 3 dimensions. 

The M signals constructed in this manner have equal energy s. We determine thi 
cross-correlation coefficient between any pair of signals based on the method used t 
select the M signals from the 2” possible signals. This topic is treated in Chapter 13 
We can show that any adjacent signal points have a cross-correlation coefficient of (se: 
Problem 9.2) 


y = — (9.4.4 


and a corresponding Euclidean distance 


d=2/€,/N. (9.4.5 


The distance between adjacent signal points affects the performance of the detector; as w 
have observed previously. 
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pt) |. 





Figure 9.9 Signal points for signals generated from binary codes. 


9.4.1 Probability of Error for Binary-Coded Signals 


As we have observed in the discussion above, a set of N-dimensional binary code words 
of the form 

Cm = (Cm1, Cm2, ---, CnN), m=1,2,...,M (9.4.6) 
are represented geometrically in vector form as 


Sm = (Sml, Sm2, ---, SmN), m=1,2,...,M, (9.4.7) 


where Smj = +,/6,;/N forall m and j, and ©, is the energy per waveform. The error prob- 
ability of a coded system can be obtained using the union bound given in Equation (8.4.42). 
The performance of a digital communication system that employs binary-coded signals is 
considered in Chapter 13. 





9.5 FREQUENCY-SHIFT KEYING 


The simplest form of frequency modulation for digital transmission is binary frequency- 
shift keying (FSK) previously described in Section 8.2.2. In binary FSK, we employ two 
different frequencies, such as fọ and fj = fo + Af, to transmit a binary information 
sequence. The choice of frequency separation AF = fı — fo is considered shortly. Thus, 
the two signal waveforms may be expressed as 
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26, 

uo(t) = ,/—cos2nfot, 0<t <T, 
T, 
{26 

u(t) = cos 2a fit, 0<t<h, 
b 


where &, is the signal energy/bit and T, is the duration of the bit interval. 
More generally, M-ary FSK may be used to transmit a block ofk = log, M bits/signg 
waveform. In this case, the M signal waveforms may be expressed as 


(9.5.1) 





2% 
Um(t) = 4f 7 Cosa fet + 2nmAft), m=0,1,....M—1, O<t<T, (9.5.2 


where €s = kG» is the energy per symbol, T = kT, is the symbol interval, and Af i; 
the frequency separation between successive frequencies, i.e., Af = fm — fm-1, where 
fn = fe + mAf. 

Note that the M FSK waveforms have equal energy és. The frequency separation Aj 
determines the degree to which we can discriminate among the M possible transmitted sig 
nals. As a measure of the similarity (or dissimilarity) between a pair of signal waveforms 
we define the correlation coefficients 


T 
ee Í E E ET (9.53 
Es 0 


Substituting for um (t) and un(t) in Equation (9.5.3), we obtain 


1 [T28 
Ymn = z Í a cos(2nf,t + 2nmA ft) cos(2n fet + 2unAft) dt 
s JO 


1 øT Lf? 
=7 Í cos2n(m —n)Aftdt + T Í cos[4n fet + 2n(m +n)Aft]dt 
0 0 


= sin 2u(m — n)AfT 


= Gade (9.5.4 


where the second integral vanishes when fe >> 1/T. A plot of Ym, as a function of th 
frequency separation Af is given in Figure 9.10. We observe that the signal waveform 
are orthogonal when Af is a multiple of 1/27. Hence, the minimum frequency separatio 
between successive frequencies for orthogonality is 1/2T. We also note that the mim 
mum value of the correlation coefficient is Yn, = —0.217, which occurs at the frequenc 
separation Af = 0.715/T. 
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Ymn 


Af Figure 9.10 Cross-correlation 
coefficient as a function of frequency 
separation for FSK signals. 





M-ary orthogonal FSK waveforms have a geometric representation as M@-dimensional 
orthogonal vectors, given as 


so = (Vs, 0,0,...,0), 
sı = (0, V;,0,...,0), 


(9.5.5) 


su-1 = (0,0,....0, v8), 


where the basis functions are Ym (t) = ./2/T cos 2n(f, + mAf)t. The distance between 
the pairs of signal vectors is d = ./2%, for all m,n, which is also the minimum distance 
among the M signals. 


9.5.1 Demodulation of M-ary FSK 


Assume that the FSK signals are transmitted through an additive white Gaussian noise 
channel. Furthermore, we assume that each signal is phase shifted in the transmission 
through the channel. Consequently, the filtered received signal at the input to the demodu- 
lator may be expressed as 





r(t) = y á cos(2n fet + 2nmAft + dm) +n(t), (9.5.6) 


where ¢,, denotes the phase shift of the mth signal and n(t) represents the additive band- 
pass noise, which may be expressed as 


n(t) = n-(t) cos 27 fet — ns(t) sin 27 fet. (9.5.7) 


The demodulation and detection of the M FSK signals may be accomplished by one 
of two methods. One approach is to estimate the M carrier phase shifts {pm} and perform 
phase-coherent demodulation and detection. As an alternative method, the carrier phases 
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may beignored in the demodulation and detection of the FSK signals. The latter method is 
called noncoherent demodulation and detection. 

In phase-coherent demodulation, the received signal r(t) is correlated with each of 
the M possible received signals cos(2n fet +2nmAft +m) m=0,1,..., M— 1, where 
{bn} are the carrier phase estimates. A block diagram illustrating this type of demodulatior 
is shown in Figure 9.11. It is interesting to note that when bn E m,m =0,1,...,M =] 
(imperfect phase estimates), the frequency separation required for signal orthogonality a 
the demodulator is Af = 1/T (see Problem 9.10), which is twice the minimum separatior 
for orthogonality when @ = ĝ. 

The requirement for estimating M carrier phases makes coherent demodulation o; 
FSK signals extremely complex and impractical, especially when the number of signals i: 
large. Therefore, we shall not consider the coherent detection of FSK signals. 

Instead, we now consider a method for demodulation and detection that does no 
require knowledge of the carrier phases. The demodulation may be accomplished as show: 
in Figure 9.12. In this case, there are two correlators per signal waveform, or a total of 2M 
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Figure 9.11 Phase-coherent demodulation of M-ary FSK signals. 
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Figure 9.12 Demodulation of M-ary FSK signals for noncoherent detection. 


correlators, in general. The received signal is correlated with the basis functions (quadra- 
ture carriers) 


Wem (t) = [Z osons +2nmAft) 
and 


Wsm(t) = ae sin(2n fet + 2nmA ft) 


form =0,1,..., M — 1.The 2M outputs of the correlators are sampled at the end of the 
signal interval and the 2M samples are passed to the detector. Thus, if the mth signal is 
transmitted, the 2M samples at the detector may be expressed as 
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sin 2n(k —m)AfT cos2u(k —m)AfT —1 . 
=V 6s | ——— oe 
eee | N Sa ama Aa 
cos 21(k —m)AfT -1 sin2n(k —m)AfT . 
=> 6 ee + —— M 
oS | RAPT, akea a R; 
(9.5.8 
where ngc and ngs denote the Gaussian noise components in the sampled outputs. 
We observe that when k = m, the sampled values to the detector are 
Yme = NV És COS Ọm + Amc» 
(9.5.5 








Yms = vV Ês sin Ọm + "ms. 





Furthermore, we observe that when k Æ m, the signal components in the samples y,, an 
Ygs Will vanish, independent of the values of the phase shift ¢;,, provided that the frequenc 
separation between successive frequencies is Af = 1/T. In such a case, the other 2(M —1 
correlator outputs consist of noise only, i.e., 


Yke = Akce» Yks = Nks, k zx m. (9.5.1¢ 








In the development that follows, we assume that Af = 1/T, so that the signals a 
orthogonal. 

It is easily shown (see Problem 9.11) that the 2M noise samples {nke} and {n,, 
are zero-mean, mutually uncorrelated Gaussian random variables with an equal varianc 
o? = No/2. Consequently, the joint PDF for Yme and Yms conditioned on $m is 














1 -[Ome—v bs cos om)? +Oms -N Os sing)? | /20? 


—~ 5 9.5.1. 
2no2 ( 


Sin (Vines Yms lom) = 

















and for m Æ k, we have 





fv, Okc» Yes) = okt.) / a (9.5.1 


2no2 


Given the 2M observed random variables {Yke, Yks, k = 0, 1,..., M — 1), the opt 
mum detector selects the signal that corresponds to the maximum of the posteri 


probabilities 
P[sm was transmitted |y] = P (Smly), m =0,1,..., M — 1, (9.5.1. 
where y is the 2M-dimensional vector with elements {yx-, Yks, k =0,1,..., M — 1}. V 


derive the form for the optimum noncoherent detector for the case of binary FSK. TI 
generalization to M-ary FSK is straightforward. 
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9.5.2 Optimum Detector for Noncoherent Binary FSK 


In binary orthogonal FSK, the two posterior probabilities are 


fy (yls0)P (So) 


P(soly) = 
ad FO) 
(9.5.14) 
fy (yls1)P(S1) 
P(s = =; 
S EO 
hence, the optimum detection rule may be expressed as 
50 
P (sol y)  P(sily) (9.5.15) 
s1 
or, equivalently, 
P 5 
fr(ylso)P (So) 2 fr(ylsi)P(s1) (9.5.16) 


fry sa fry ° 


where y is the four-dimensional vector y = (Yoc, Yos, Yic» Yis)- The relation in Equation 
(9.5.16) simplifies to the detection rule 
fy (ylso) XV P(si) 
<= . 
fy (yls1) sı P(So) 


The ratio of PDFs in the left-hand side of Equation (9.5.17) is the likelihood ratio, which 
we denote as 





(9.5.17) 


fr (yls0) 
fr lsi) 
The right-hand side of Equation (9.5.17) is the ratio of the two prior probabilities, which 


takes the value of unity when the two signals are equally probable. 
The PDFs fy (yl|so) and fy(y|sı) in the likelihood ratio may be expressed as 


A(y) = (9.5.18) 


2x 


Jels) = fr, Oic, Yis) i JY Yoc» Yos |G) fo (bo)d ho, 


(9.5.19) 
2x 


Fr (yls= fry oc, Yos) ‘ Fr, ie Visio.) fo(oid¢r, 


where fY,,(Ymcs Ymslm) and fy, (Yke, Yks) m Æ k are given by Equations (9.5.11) and 
(9.5.12), respectively. Thus, the carrier phases øo and ¢ are eliminated by simply averag- 
ing fYm (Yme, Ymslbm)- 

The uniform PDF for $m represents the most ignorance regarding the phases of the 
carriers. This is called the least favorable PDF for om. When fo,, (dm) = 1/21, 0 < Ọm < 
2x, is substituted into the integrals given in Equation (9.5.19), we obtain 
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1 2x 
Aa F¥m Yes Yms lm) dm 


20 0 
j oer Se are) 2 1 2x i 
= T (v2ety2s+6s)/20 =f oV bs (ime 008 bm+yms sin dm) / 0? don. (9.5.20) 
But 
1 27n lee n 2 Ss (y2 + y2 ) 
ea eV Om eosin tration) o? ag, = p(X" | asa 


where Jo(x) is the modified Bessel function of order zero. This function is a monotonically 
increasing function of its argument, as illustrated in Figure 9.13. Jo(x) has the power series 
expansion 


oo 2k 


x 


From Equations (9.5.17) to (9.5.19), we obtain the likelihood ratio in the form 


A(y) = < . 
h (S8 (v8. +3) 707) 7 POO) 





(9.5.23) 


Thus, the optimum detector computes the two envelopes 


Yo = y Yoo + Ys 


Figure 9.13 Graph of Jp(x). 
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n= Vit Vs 


and the corresponding values of the Bessel function 


(Var) 
(Vir) 


to form the likelihood ratio. We observe that this computation requires knowledge of the 
noise variance g? and the signal energy €s. The likelihood ratio is then compared with the 
threshold P (s1)/P (so) to determine which signal was transmitted. 

A significant simplification in the implementation of the optimum detector occurs 
when the two signals are equally probable. In such a case, the threshold becomes unity and, 
due to the monotonicity of the Bessel function, the optimum detector rule simplifies to 


50 
VY + Vos žy y + y2. (9.5.24) 


Thus, the optimum detector bases its decision on the two envelopes 


yo = y ve + Ns 
y= y ty; 


hence, it is called an envelope detector. 

The computation of the envelopes of the received signal samples at the demodulator’s 
output renders the carrier signal phases {@,,} irrelevant in the decision as to which signal 
was transmitted. Equivalently, the decision may be based on the computation of the squared 
envelopes yo and ye; in this case, the detector is called a square-law detector. Figure 9.14 
shows the block diagram of the demodulator and the square-law detector. 

The generalization of the optimum demodulator and detector to M-ary orthogo- 
nal FSK signals is straightforward. As illustrated in Figure 9.12, the output of the opti- 
mum demodulator at the sampling instant consists of the 2M vector components (Yoc, Yos, 
Vie, Yis, +++» YM—1c; YM—1is). Then the optimum noncoherent detector computes the M 


envelopes as 
Ym = ¥z-+y2;, m=0,1,...,.M—1. (9.5.25) 


Thus, the unknown carrier phases of the received signals are rendered irrelevant to the 
decision as to which signal was transmitted. When all the M signals are equally likely to be 
transmitted, the optimum detector selects the signal corresponding to the largest envelope 
(or squared envelope). In the case of nonequally probable transmitted signals, the optimum 


and 


and 


and 
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Figure 9.14 Demodulation and square-law detection of binary FSK signals. 


detector must compute the M posterior probabilities in Equation (9.5.13) and then select 
the signal corresponding to the largest posterior probability. 


9.5.3 Probability of Error for Noncoherent Detection of M-ary FSK 


Consider M-ary orthogonal FSK signals that are detected noncoherently. We assume that 
the M signals are equally probable a priori and that uo(t) was transmitted in the interval 
O<t<T. 

The M-decision metrics at the detector are the M envelopes 


Ym = fy? + yng m=0,1,...,M—1, (9.5.26 


where 
Yoc = VE; cos ho + noc, 
(9.5.27, 
Yos = VE; sino + Nos 
and 
Yme = Amec, M=1,2,...,M — 1, 
mg, i (9.5.28; 


Yms = nms, Mm = 1,2,..., M — 1. 


The additive noise components {nme} and {nms } are mutually statistically independent zero- 
mean Gaussian variables with equal variance o? = No /2. Thus, the PDF’s of the random 
variables at the input to the detector are 
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g (y + yp.) 
1 ee ee are: Bs 2 s \ Yo, 0. 
Fro ees You) = ze (odct98,+8s) /20? 7, es (9.5.29) 





Be Ee eee 2 
fen Ome, Yms) = e (het ys) 2 = Ie eRe M —1. (9.5.30) 


2mo? 


Let us make a change in variables in the joint PDFs given by Equations (9.5.29) and 
(9.5.30). We define the normalized variables 





R SEn Yoz + Fas 
m = ’ 

y (9.5.31) 
On = tan! = 


mc 


Clearly, Yme = o R m cos © m and Yms = o Rm sin Om. The Jacobian of this transformation is 





_ | 0 COs Om o sin Onm _ 2 
I= —o Rp sin Om o Rm COS Om | I Rm. (9.5.32) 
Consequently, 
ro 2 2€ 
free = Ze (34265/No)/2 J, a” ‘ (9.5.33) 
Somon (Tm, Om) = ae m=1,2,...,M—1. (9.5.34) 


Finally, by averaging fr,,@,(%m> Om) over Om, the factor of 21 is (see Example 5.1.11 and 
Problem 5.30) eliminated from Equations (9.5.33) and (9.5.34). Thus, we find that Ro has 
a Rice probability distribution and Rm, m = 1,2, ..., M — 1 are each Rayleigh distributed. 

The probability of a correct decision is simply the probability that Ro > Ri, Ro > 
Ro, ..., and Ro > Ry_1. Hence, 


P= P(R, < Ro, R2 < Ro, we, RM1 < Ro) 


00 (9.5.35) 
= it P(Rı < Ro, Ro < Ro, ..., Ru-1 < Ro|Ro = x) fro (x) dx. 
0 


Because the random variables Rm, m = 1,2,..., M — 1 are statistically i.i.d., the joint 


probability in Equation (9.5.35) conditioned on Ro factors into a product of M — 1 identical 
terms. Thus, 


P, = Í [P(R1 < RolRo = x)I™~" fro (x) dx, (9.5.36) 
0 
where 
P(R < Rol Ro = x) = f fri (rı)drı 
0 


=1- e”, (9.5.37) 























a 
















































































es 



































512 Multidimensional Digital Modulation Chapter ç 


The (M — 1)st power of Equation (9.5.37) may be expressed as 


_) M-I = 
[isee] — reo eae (9.5.38) 
n=0 


Substitution of this result in Equation (9.5.36) and integration over x yields the probability 
of a correct decision as 


M—i 

M-1 1 

Pe = > cne( i ) o (9.5.39 
=0 


where ps = @;/No is the SNR/symbol. Then, the probability of a symbol error, which is 
Py = 1 — P,, becomes 


M M-1\ 1 
Py = —1 ntl = = eben) 5.40 
m 2 ) ( 5 Jr (9.5.40 


where pp = ,/No is the SNR/bit. 
For binary FSK (M = 2), Equation (9.5.40) reduces to the simple form 


1 
P = Rous: (9.5.41 
For M > 2, we may compute the probability of a bit error by using the relationship 


Qk-1 

P, = ——— Py, 9.5.42 
b= > Pu ( 
which was established in Section 9.1.1. Figure 9.15 shows the bit-error probability as func- 
tion of the SNR/bit pp for M = 2, 4, 8, 16, and 32. Just as in the case of coherent detectior 
of M-ary orthogonal signals (see Section 9.1.1), we observe that for any given bit-erro: 
probability, the SNR/bit decreases as M increases. It will be shown in Section 12.6 that, ir 
the limit as M — oo (or k = logy M — oo), the probability of a bit-error P, can be mad 

arbitrarily small provided that the SNR/bit is greater than the Shannon limit of —1.6 dB. 


Example 9.5.1 


Compare the error probability of binary FSK, which is given by Equation (9.5.41), with tha 
for binary differential PSK (DPSK), which is given by Equation (8.6.42). 


Solution The error probability for binary DPSK is 


1. 
f= 5° Rb: 


Comparing this expression with that for binary FSK, we observe:that binary FSE 
requires twice the transmitted signal energy to achieve the same performance as binar: 
DPSK. |] 
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9.6 MODULATION SYSTEMS WITH MEMORY 














All modulation systems studied so far are memoryless, meaning that the transmitted signal 
in any signaling interval depends only on the current information that is to be transmitted 
and does not depend on the information transmitted during past signaling intervals. In 
this section, we study modulation systems with memory. In these systems, the transmitted 
signal in each interval depends on the current information as well as previously transmitted - 
information. We will see that these types of modulation schemes usually provide higher 
bandwidth efficiency and the inherent memory in these modulation schemes can be utilized 
to achieve better performance. 


9.6.1 Continuous-Phase FSK 
Ordinary FSK signals may be generated by having M = 2* separate oscillators tuned to the 


desired frequencies fe + mAf = fm and selecting one of the M frequencies according to 
the particular k-bit symbol that is to be transmitted in a signal interval. However, abruptly 
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switching from one oscillator output to another in successive signaling intervals results jn 
relatively large spectral sidelobes outside of the main spectral band of the signal, which 
decay slowly with frequency separation. Consequently, this method wastes bandwidth. 

To avoid the use of signals with large spectral sidelobes, we may use the information- 
bearing signal to frequency modulate a single carrier whose phase is changed in a contin- 
uous manner. The resulting frequency-modulated signal is phase continuous; hence, it js 
called continuous- phase FSK (CPFSK). 

In order to represent a CPFSK signal, we begin with a PAM signal 


v(t) = È angr(t — nT), (96.1) 


where the amplitudes are obtained by mapping k-bit blocks of binary digits from the infor- 
mation sequence into the amplitude levels +1, +3, ..., +(M — 1), and gr (t) is a rectangu- 
lar pulse of amplitude 1/2T and duration T. The signal v(t) is used to frequency modulate 
the carrier. Consequently, the frequency-modulated carrier is 


u(t) =j = cos EZ + anTfa | vende (9.6.2) 


where fy is the peak-frequency derivation. Note that the instantaneous frequency of the 
carrier is fe + 2T fgv(t). 

We observe that, although v(t) contains discontinuities, the integral of u(t) is con- 
tinuous. We may denote the phase of the carrier as 


t 
O(t;a) = anti | v(t)dt, (9.6.3: 
= 
where a denotes the sequence of signal amplitudes. Since 0 (t; a) is a continuous functior 
of t, we have a continuous-phase signal. 
The phase of the carrier in the interval nT < t < (n + 1)T is determined by the 
integral in Equation (9.6.3). Thus, 


n-1 

O(t;a) =2nfaT 2 ay + 2n(t — nT) faan Cre 

=, + 2mha,g(t —nT), 

where h, 6,, and q(t) are defined as 
h=2faT, (9.6.5 
n—1 
On = mh Y. ay, (9.6.6 
k=—0O 
0, t<0 

q(t)= { t/2T, O<t <T. (9.6.7 


1 
7’ t>T 
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8r) q(t) 
1 

2 f 

1 

1 

a. 

2T | 

L] 

i Figure 9.16 The signal plot 8r (t) 
0 T t 0 T t 


and its integral q(t). 


The parameter h is called the modulation index. We observe that 6, represents the phase 
accumulation (memory) from all symbols up to time (n — 1)T. The signal q(t) is simply 
the integral of the rectangular pulse, as illuswated in Figure 9.16. 

It is instructive to sketch the set of all phase wajectories 0(t; a) generated by all 
possible values of the information sequence {a,}. For example, with binary symbols, a, = 
+1, the set of phase trajectories beginning at time t = 0 is shown in Figure 9.17. For 
comparison, the phase trajectories for quaternary CPFSK (a, = +1, +3) are illustrated in 
Figure 9.18. These phase diagrams are called phase trees. We observe that the phase trees 
are piecewise linear because the pulse gr(t) is rectangular. Smoother phase trajectories 
and phase trees may be obtained by using pulses that do not contain discontinuities. 

The phase trees shown in these figures grow with time. However, the phase of the 
carrier is unique only in the range 0 = 0 to 0 = 2x or, equivalently, from 6 = —x to 
0 = x. When the phase trajectories are plotted modulo 21, e.g., in the range (—1, 11), the 
phase tree collapses into a structure called a phase trellis. 

Simpler representations for the phase trajectories can be obtained by displaying only 
the terminal values of the signal phase at the time instants t = nT. In this case, we restrict 
the modulation index h to be rational. In particular, let us assume that h = m/ p, where m 
and p are relatively prime integers. Then, at the time instants t = nT, the terminal phase 
states for m even are 


2 —1 
Bee fo, a OME, i com (9.6.8) 

p p Pp 

and for m odd are 5 2 i 
as = fo, zm mn. Cez Dn). (9.6.9) 

Pp p p 


Hence, there are p terminal phase states when m is even and 2p terminal phase states 
when m is odd. For example, binary CPFSK with h = 1/2 has four terminal phase states, 
namely, 0, 1/2, 1, 31/2. The state trellis for this signal is illustrated in Figure 9.19. In the 
state trellis, the phase transitions from one state to another are not true phase trajectories. 
They represent phase transitions to the terminal states at the time instants t = nT. 

An alternative representation to the state trellis is the state diagram, which also illus- 
trates the state transitions at the time instants £ = nT. This is an even more compact 
representation of the CPFSK signal. Only the possible terminal phase states and their tran- 
sitions are displayed in the state diagram. Time does not appear explicitly as a variable. For 
example, the state diagram for the CPFSK signal with h = 1/2 is shown in Figure 9.20. 
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Figure 9.17 Phase trajectory for binary CPFSK. 


We should emphasize that a CPFSK signal cannot be represented by discrete point: 
in signal space as in the case of PAM, PSK, and quadrature amplitude modulation (QAM) 
because the phase of the carrier is time variant. Instead, the constant amplitude CPFSK sig 
nal may be represented in two-dimensional space by a circle, where points on the circle rep 
resent the combined amplitude and phase trajectory of the carrier as a function of time. Fo 
example, Figure 9.21 illustrates the signal-space diagrams for binary CPFSK with h = 1/: 
and h = 1/4. The dots at 0 = 0, 1/2, x, 3n/2 and 0 = 0, tn/4, tn/2, +31/4, x fo: 
h = 1/2 and h = 1/4, respectively, represent the terminal phase states previously show: 
in the state diagram. 


Minimum-Shift Keying. Minimum-shift keying (MSK) is a special form o 
binary CPFSK in which the modulation index h = 1/2. Thus, the phase of the carrie 
for the MSK signal is 
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2T IE 4T 


g. Figure 9.20 State diagram for binary CPFSK with h = 1/2. 


(a) (b) 


The expression in Equation (9.6.11) indicates that the MSK (binary CPFSK) signi 
is basically a sinusoid consisting of one of two possible frequencies in the interval nT; : 
t < (n + 1)Tp, namely, 


Figure 9.21 Signal space diagram for binary CPFSK 
with (a) h = 1/2 and (b) h = 1/4. 


1 
fi=se- tr 


1 (9.6.12 
fro=fet+ T 
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Hence, the two sinusoidal signals may be expressed as 


2% 
u(t) = E cos [an fit + 0a +=], i=1,2 (9.6.13) 
T, 7 


The frequency separation is Af = f2 — fı = 1/2Tp. Recall that this is the minimum fre- 
quency separation for orthogonality of the two sinusoids, provided the signals are detected 
coherently. This explains why binary CPFSK with h = 1/2 is called minimum-shift keying. 
Note that the phase of the carrier in the nth signaling interval is the phase state of the signal 
that results in phase continuity between adjacent intervals. It is interesting to demonstrate 
that MSK is also a form of four-phase PSK. To prove this point, we begin with a four-phase 
PSK signal, which has the form 


u(t) = i | l D An 87 (t — zm | cos 2n fet 


Tp n=—00 

(9.6.14) 

foe) 

+ | 5 an+18r(t — 2nTp — m snafu : 
n=—00 
where gy (t) is a sinusoidal pulse defined as 
sin- 0 <1 <2%; 
2T,, 
gr(t) = (9.6.15) 


0, otherwise 


and illustrated in Figure 9.22. First, we observe that the four-phase PSK signal consists 
of two quadrature carriers, cos 27x f,t and sin 27x f,t, which are amplitude modulated at a 
rate of one bit per 2T, interval. The even-numbered information bits {a2} are transmitted 
by modulating the cosine carrier, while the odd-numbered information bits {a2,4;} are 
transmitted by amplitude modulating the sine carrier. Note that the modulation of the two 
quadrature carriers is staggered in time by T, and that the transmission rate for each carrier 
is 1/2T,. This type of four-phase modulation is called offset quadrature PSK (OQPSK) or 
staggered quadrature PSK (SQPSK). 


gr(t) = sin nt/2T, | 





Figure 9.22 Sinusoidal pulse shape. 




























































































520 Multidimensional Digital Modulation Chapter 9 


Figure 9.23 Representation of an MSK signal as a 

form of two staggered binary PSK signals, each with 
a sinusoidal envelope. (a) In-phase signal component, 
(b) quadrature signal component, and (c) MSK signal 





O T 2T 3T 4T ST 6T TMT 8T 
(c) (a+b). 


Figure 9.23 illustrates the SQPSK signal in terms of the two staggered quadrature. 
modulated binary PSK signals. The corresponding sum of the two quadrature signals is < 
constant-amplitude, continuous-phase FSK signal, as shown in Figure 9.23. 

Itis also interesting to compare the waveforms for MSK with the waveforms for stag. 
gered quadrature PSK (QPSK), in which the pulse gr(t) is rectangular for 0 < t < 27% 
and with the waveforms for conventional QPSK, in which the baseband pulse is rectangu- 
lar in the interval 0 < t < 2Tẹ. We emphasize that all three of these modulation method: 
result in identical data rates. The MSK signal is phase continuous. The SQPSK signal witt 
a rectangular baseband pulse is basically two binary PSK signals for which the phase tran 
sitions are staggered in time by T, seconds. Consequently, this signal contains phase jump: 
of +90° that may occur as often as every Tp seconds. On the other hand, in conventiona 
QPSK with constant envelope, one or both of the information symbols may cause phas 
transitions as often as every 2T, seconds. These phase jumps may be +180° or +90°. Ai 
illustration of these three types of four-phase PSK signals is shown in Figure 9.24. 

From this description, it is clear that CPFSK is a modulation method with memory 
The memory results from the phase continuity of the transmitted carrier phase from on: 
symbol to the next. As a consequence of the continuous-phase characteristics, the powe 
spectra of CPFSK signals are narrower than the corresponding FSK signals in which th 
phase is allowed to change abruptly at the beginning of each symbol. 


Continuous-Phase Modulation. When the phase of the carrier is expresse 
in the form of Equation (9.6.4), CPFSK becomes a special case of a general class o 
continuous-phase modulated signals in which the carrier phase is 
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\ 





(a) 





4T, 


—90° phase shift +90° phase shift 





+90° phase shift 
No data transitions 





—90° phase shift 


Figure 9.24 Signal waveforms for (a) MSK, (b) offset QPSK (rectangular pulse), and (c) conventional QPSK 
(rectangular pulse). (From Gronemeyer and McBride; ©1976 IEEE.) 


O(t;a) =2n DY) ahg(t—kT), nT <t<(n+DT, (9.6.16) 


k=—00 


where {ax} is the sequence of M-ary information symbols with possible values +1, +3, ..., 
+(M — 1), and q(t) is some arbitrary normalized waveform. Recall that for CPFSK, 
q(t) =t/2T for0 <t < T, q(t) =Ofort < 0, and q(t) = 1/2 fort >T. 

The waveform q (t) is the integral of a pulse gr (t) of arbitrary shape, i.e., 


q(t) = [ 8r(t)dt. (9.6.17) 


If gr(t) = 0 for t > T, the CPM signal is called a full-response CPM signal. If the signal 
pulse gr (t) is nonzero for t > T, the modulated signal is called partial response CPM. In 
Figure 9.25, we illustrate several pulse shapes for gr (t) and the corresponding q (t). It is 
apparent that there is an infinite number of CPM signals that can be obtained by selecting 
different pulse shapes for gr (t) and by varying the modulation index h and the number of 
symbols M. 

The primary reason for extending the duration of the pulse g(t) beyond the time 
interval 0 < t < T is to further reduce the bandwidth of the transmitted signal. We note 
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that, when the duration of the pulse gr(t) extends over the time interval 0 < t < LT, 
where L > 1, additional memory is introduced in the CPM signals; hence, the number of 
phase states increases. 

Three popular pulse shapes are given in Table 9.1. LREC denotes a rectangular pulse 
of duration LT, where L is a positive integer. In this case, L = 1 results in a CPFSK 
signal with the pulse, as shown in Figure 9.25(a). The LREC pulse for L = 2 is shown 
in Figure 9.25(c). LRC denotes a raised cosine pulse of duration LT. The LRC pulses 
corresponding to L = 1 and L = 2 are shown in Figures 9.25(b) and (d), respectively. 

The last pulse given in Table 9.1 is a Gaussian minimum-shift keying (GMSK) pulse 
with the bandwidth parameter B, which represents the —3 dB bandwidth of the Gaussian 
pulse. Figure 9.25(e) illustrates a set of GMSK pulses with time-bandwidth products BT 
ranging from 0.1 to 1. We observe that the pulse duration increases as the bandwidth of 
the pulse decreases, as expected. In practical applications, the pulse is usually truncated 
to some specified fixed duration. GMSK with BT = 0.3 is used in the European digital 
cellular communication system, called GSM. From Figure 9.25(e), we observe that when 
BT = 0.3, the GMSK pulse may be truncated at |t| = 1.5T with a relatively small error 
incurred fort > 1.5T. 

Due to the phase continuity inherent in the CPFSK and CPM signals, these modula- 
tion schemes have memory, thus, their demodulation and detection is much more complex 
from a computational viewpoint. In general, the demodulator cross correlates the received 
signals in each signals interval with each of the possible phase-modulated transmitted sig- 
nals and passes the correlation metrics to the maximum-likelihood (ML) detector. The 
detector exploits the memory (phase continuity) in the transmitted signal by performing 
ML sequence detection, as opposed to ML symbol-by-symbol detection, using the correla- 
tion metrics in each signal interval. In Chapter 13, we introduce a computationally efficient 
method, called the Vitebi algorithm, for performing ML sequence detection. 


TABLE 9.1 SOME COMMONLY USED CPM PULSE SHAPES 


LREC 
1 
— (O<t<LT) 
2 
8r (t) = LI: 
0, otherwise 
LRC 
1 27t 
a z paea 0< T 
ZET (1 cos), <t<L 
&r(t) = 
0, otherwise 
GMSK 


gr) = [O [2x8 (e - 7) /dn2y"?] - Q [278 (t + F) /0n2y'?)| 


1 © ie 2p 
= x d. 
o0) 2n f Š 3 










































































524 Multidimensional Digital Modulation Chapter c 
9.6.2 Spectral Characteristics of CPFSK Signals 


In this section we consider the spectral characteristics of CPFSK signals and present some 
results on their power spectral density. The derivation of these results may be found jy 
more advanced digital communication textbooks, e.g. Anderson et al. [1986] and Proakis 
and Salehi [2008]. 

A CPFSK signal may be expressed in the general form 


[26 

u(t; a) = a 
= Re | [2@s a) 

T z 


where the carrier phase 0 (t; a) is given by Equation (9.6.16). The lowpass equivalent o; 
this signal is the complex-valued baseband signal 


u(t) =,/ re jolta), (9.6.19 


which is sufficient to focus our attention on the spectral characteristics of the information 
bearing signal v(t). 

The computation of the power spectral density of the CPFSK (or CPM) signal i: 
somewhat difficult and tedious. The difficulty is due to the memory in the CPFSK signa 
and the exponential relationship between v(t) and 0 (t; a). The general procedure involve; 
the computation of the autocorrelation function of the baseband signal v(t) and, then, com 
puting the Fourier transform of the autocorrelation function to obtain the power spectra 
density Sy (f). 

The power spectral density of the real-valued signal u(t; a) is then found by translat 
ing Sy(f) in frequency by the carrier f,. Thus, we obtain 


t+0(t;a)] 








(9.6.18 


1 
Su(f) = gE — fe) + Sv(F + fo). (9.6.20 


In the case of the CPFSK signal, this procedure yields the following expression fo 
the power spectral density: 


Sy(f) ~70,/ 53 DAO ADI Bum(f) An aad | (9.6.21 


n=1 m=1 
where 


sin n[ fT — (Qn — 1 — M)h/2) 
x [fT —(2n—1—M)h/2] 


l h(2n —1 — M) 
= sine (gr - 121 ). 


An(f) = (9.6.22 
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COs (2nfT = Anm) = B COS nm 





Bum(f) = —| + pi—2BpcosinfT (9.6.23) 
Onm = TA(m +n — 1 — M), (9.6.24) 
_ sin Mah (9.6.25) 

~ Msinmh’ oH 


The power spectral density of CPFSK for M = 2 is plotted in Figure 9.26 as a func- 
tion of the normalized frequency fT, withthe modulation index h = 2 f4T as a parameter. 
Note that only one-half of the spectrum is shown in these graphs, because the spectrum is 
symmetric in frequency. The origin fT = 0 corresponds to the carrier frequency fe in the 
spectrum of the real-valued signal. 

These graphs show that the spectrum of the CPFSK signal is relatively smooth and 
well confined for h < 1. As h approaches unity, the spectra become very peaked and, for 
h = 1, where |6| = 1, we find that impulses occur at M frequencies, which is a situation 
that is generally avoided in practice. In communication systems that employ CPFSK, the 
modulation index is selected to conserve bandwidth, so that h < 1. 

The special case of binary CPFSK with h = 1/2 (or fy = 1/4T,) and £ = 0 corre- 
sponds to MSK. In this case, the power spectral density obtained from Equations (9.6.21) 
through (9.6.24) is 





Sy(f) = (9.6.26) 


m2 


326; l cos2nfT, 1? 
1—16 fT? 


In contrast, the power density spectrum of SQPSK with a rectangular pulse gr (t) of dura- 
tion 2T, is ; 
in2n fT; 

wa) . (9.6.27) 


2x fT, 


The power density spectra in Equations (9.6.26) and (9.6.27) are illustrated in 
Figure 9.27. Note that the main lobe of MSK is 50% wider than that of SQPSK. However, 
the sidelobes of MSK fall off considerably faster. As a consequence, MSK is significantly 
more bandwidth efficient than SQPSK. 

In the more general case of CPM signals, the use of smooth pulses such as raised 
cosine pulses (LRC) of the form given in Table 9.1, where L = 1 for full response and 
L > 1 for partial response, result in smaller bandwidth occupancy and, hence, in greater 
bandwidth efficiency than the use of rectangular pulses. For example, Figure 9.28 illus- 
trates the power-spectral density for binary CPM with different partial-response raised 
cosine (LRC) pulses and h = 1/2. For comparison, the spectrum of binary CPFSK with 
h = 1/2 (MSK) is also shown. We note that as L increases, the pulse gy (t) becomes 
smoother and the corresponding spectral occupancy of the signal is reduced. 


Sv(f) = 48, ( 





9.7 COMPARISON OF MODULATION METHODS 


The digital modulation methods described in this chapter and Chapter 8 can be compared 
in a number of ways. For example, we can compare them on the basis of the SNR required 
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Figure 9.27 Power density spectra of MSK and QPSK. (From Gronemeyer and McBride; © 1976 IEEE.) 
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Figure 9.28 Power spectral density for binary CPM 
with h = 1/2 and different pulse shapes. (From Aulin 
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to achieve a specified probability of error. However, such a comparison would not be ve: 
meaningful unless it were made on the basis of some constraint, such as a fixed data ra 
of transmission. 

Suppose that the bit rate R, is fixed. Consider the channel bandwidth required 
transmit the various signals. If we employ M-ary PAM, where M = 2*, the channel ban: 
width required to transmit the signal is simply the bandwidth of the signal pulse gr (1 
which depends on its detailed characteristics. For our purposes, we assume that g7 (t) 
a pulse of duration T and that its bandwidth W is approximately 1/27, where T is tl 
symbol interval. In one symbol interval, we can transmit k information bits, so T = k /ì 
sec. Hence, the channel bandwidth required to transmit the M-ary PAM signal is 


W = R,/2k = R,/2log, M Hz. (9.7. 


If the PAM signal is transmitted at bandpass as a double-sideband suppressed carri 
signal, the required channel bandwidth is twice that for the baseband channel. However, t) 
bandwidth of the bandpass PAM signal can be reduced by a factor of 2 by transmitting on 
one of the sidebands (either the upper or the lower sideband of the bandpass signal). Tht 
the required channel bandwidth of the single-sideband bandpass PAM signal is exactly t 
same as the bandwidth of the baseband signal. 

In the case of QAM, the channel bandwidth is (approximately) W = 1/T, but sin 
the information is carried on two quadrature carriers, it follows that T = 2k/ R», where 
is the number of information bits/carrier. Hence, 


W= R,/2k = R,/2log, Mpam 


9.7. 
= R,/ logy Maan, ( 


where the number of signals for M-ary QAM, denoted as Mgam, is equal to the square 
the number Mpam of PAM signals. 

For M-ary phase modulation (M > 2), the channel bandwidth required to transr 
the multiphase signals is W = 1/T, where T = k/ Rp. Hence, 


W = R,/k = R,/ log, M. (9.7 


Note that PAM, QAM, and PSK signals have the characteristic that, for a fixed 
rate Rp, the channel bandwidth decreases as the number of signals M increases. This mea 
that, with increasing M, the system becomes more bandwidth efficient. On the other har 
examination of Figures 8.46, 8.51, and 8.61 shows that in all these systems, at a giv 
€z/No, increasing M increases the error probability and thus deteriorates the performan: 
In other words, in these systems, increasing M increases the bandwidth efficiency a 
decreases the power efficiency. This is a direct consequence of the fact that, in these s: 
tems, the dimensionality of the signal space N is one (for PAM) or two (for PSK and QA] 
and is independent of M. 

Orthogonal signals have completely different bandwidth requirements. For examp 
if we employ PPM signals, the symbol interval T is subdivided into M subintervals 
duration T/M and pulses of width T/M are transmitted in the corresponding subinterv: 
Consequently, the channel bandwidth required to transmit the PPM signals is . 
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W = M/2T = M/2(k/Ro) = MR»p/21log, M Hz. (9.7.4) 


An identical result is obtained if the M orthogonal signals are constructed as M-ary FSK 
with a minimum frequency separation of 1/2T for orthogonality. Biorthogonal and simplex 
signals result in similar relationships as PPM (orthogonal). In the case of biorthogonal sig- 
nals, the required bandwidth is one-half of that for orthogonal signals. From the bandwidth 
relation for orthogonal signals, we can see that for a fixed Rp, increasing M increases 
the bandwidth proportional to M/(2log, M). This shows that, in this case, increasing 
M decreases the bandwidth efficiency of the system. On the other hand, examination of 
Figures 9.3 and 9.15 shows that in these systems, for a fixed €,/No, increasing M improves 
the performance, and thus the power efficiency, of the system. It is also interesting to note 
that in orthogonal, biorthogonal, and simplex signaling schemes, the dimensionality of the 
signal space is not fixed and increases with an increasing M. 

From the foregoing discussion, it is clear that the characteristics of PAM, PSK, and 
QAM are completely different from the characteristics of orthogonal, biorthogonal, and 
simplex schemes. Therefore, their applications are also quite different. 

In general, it can be shown that the minimum bandwidth requirement for any digital 
modulation scheme is given by 


R,N 


W =——__, (9.7.5) 
2log, M 


where Rp is transmission rate in bits/sec, N is the dimensionality of the signal space, 
and M is the number of points in signal constellation. Using this general equation, the 
minumum transmission bandwidth required for different communication schemes is given 
in Equation (9.7.6). 


PAM: W= aoe 
2log, M 
Rp 
MPSK: W = ——, for M > 2, 
log, M 
R 
BPSK: W =~", 
2log, M 
QAM: w=—2 9.7.6 
NS Tae af (9.7.6) 
MR 
Orthogonal Signaling: W = eS, 
2 log, M 
Biotthoponat Siemans WS 
iorthogonal Signaling: W = Flog, M’ 
Siping Wee 
implex Signaling: W = ' 
P 8 E 2log, M 


A compact and meaningful comparison of these modulation methods is based on 
the normalized data rate (also called the spectral bit rate) r = R/W (bits per second 
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per hertz of bandwidth) versus the SNR/bit (€,/No) required to achieve a given erro 
probability. We can use Equation (9.7.6) to derive expressions for the spectral bit rate j; 
different signaling schemes: 


_ PAM: r = 2log, M, 
‘MPSK: r = log, M, for M > 2, 
BPSK: r = 2log, M, 
QAM: r = log, M, 


2log, M (9.7.7 
Orthogonal Signaling: r = m 
4log, M 
Biorthogonal Signaling: r = E : 
: ET 2log, M 
implex Signaling: r aA 


Figure 9.29 illustrates the graph of r = R,/W (measure of bandwidth efficiency) versu 
(G,/No) (measure of power efficiency) for PAM, QAM, PSK, and orthogonal signals fc 
the case in which the symbol error probability is Py = 1075. As discussed earlier i 
the case of PAM, QAM, and PSK, increasing the number of signal points M results in 
higher bit-rate-to-bandwidth ratio r = R»/ W. However, the cost of achieving the high 
data rate is an increase in the SNR/bit. Consequently, M-ary PAM, QAM, and PSK a 
appropriate for communication channels that are bandwidth limited, where we desire a bi 
rate-to-bandwidth ratior > 1, and where there is sufficiently high SNR to support multip! 
signal amplitudes and phases. Telephone channels are examples of such bandlimited char 
nels. The curve denoted by “Shannon limit” illustrates the boundary between the regio 
where reliable communication is possible and the region in which reliable communicatic 
is impossible (the shaded area). This curve is given by the relation 


Sp 27-1 


— = : 9.7.4 
No r ( 





This relation is derived in Chapter 12 [see Equation (12.6.5)]. 

We have already observed that the cost of doubling the number of phases (increasir 
the number of bits/symbol by one bit) in PSK approaches 6 dB (a factor of 4) in addition 
transmitted power for large M. A similar comparison for QAM indicates that the increa: 
in transmitted power is approximately 3 dB per additional bit/symbol. Table 9.2 gives tl 
factor 10 log,(M — 1)/3, which represents the increase in average power required to mai 
tain a given level of performance for QAM as the number of signal points in the rectangul 
constellation increases. Thus, we observe that QAM is preferable to PSK (and PAM) f 
large signal constellation sizes. 

In contrast to PAM, QAM, and PSK, M-ary orthogonal signals yield a bit-rate-t 
bandwidth ratio of r < 1. As M increases, r decreases due to an increase in the requir 
channel bandwidth. However, the SNR/bit required to achieve a given error probability ( 
this case, Py = 1075) decreases as M increases. Consequently, M-ary orthogonal signa 
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Figure 9.29 Comparison of several modulation methods at 10-5 symbol rate. 


TABLE 9.2 QAM SIGNAL CONSTELLATIONS 


Number of signal Increase in average 
points M power (dB) relative to M = 2 
4 3 

8 6.7 

16 10.0 

32 13.2 

64 16.2 


128 19.2 
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as well as biorthogonal and simplex signals, are appropriate for power-limited channel 
that have a sufficiently large bandwidth to accommodate a large number of signals. In thi: 
case, as M -—> œ, the error probability can be made as small as desired, provided tha 
€p/No > 0.693 ~ (—1.6dB). This is the minimum SNR/bit required to achieve reliabk 
transmission in the limit as the channel bandwidth W —> oo and the corresponding pit 
rate-to-bandwidth ratio r — 0. 


9.8 SUMMARY AND FURTHER READING 


In this chapter, we extended our coverage of M-ary digital modulation to multidimensiona 
signals and derived their performance in additive white Gaussian noise (AWGN) channels 
The multidimensional signals we considered are orthogonal, biorthogonal, simplex, anı 
binary-coded signals. The dimensionality of M-ary orthogonal, biorthogonal, and sim 
plex signals was shown to be N = M. In contrast, the M-ary binary-coded signals ar 
N-dimensional vectors, where N > M. 

We also described M-ary frequency-shift keying (FSK) signals and demonstrate: 
that when adjacent carrier frequencies are separated by Af = 1/T, where T is the symbc 
interval, the FSK signals have a geometric representation as M orthogonal vectors. W 
observed the difficulty and impracticality of implementing an M-ary FSK demodulato 
that must estimate M carrier phases required to perform phase-coherent demodulatior 
Instead , we demonstrated that the demodulator may ignore the multiple carrier phases b 
computing the envelopes of the received signals and passing the envelopes to the detecto: 
We also derived the probability of error for the FSK detector whose inputs are the envelope 
of the received signal-plus-noise for the M-ary FSK signals. 

Furthermore, we considered modulation systems with memory, emphasizing a spe 
cial type of M-ary FSK in which the phase of the transmitted signal carrier is constrained t 
be continuous as we switch from one carrier frequency to another at the end of each symbc 
interval. This is called continuous-phase FSK (CPFSK). The major reason for imposing th 
phase-continuity constraint is to reduce the sidelobes in the spectral characteristics of th 
FSK signal and, thus, reducing the transmitted signal bandwidth. A further generalizatio 
of CPFSK, called continuous-phase modulation (CPM), is obtained by allowing the basi 
pulse shape used in CPFSK to be different than rectangular. 

CPFSK and CPM are especially suitable for wireless digital communications becaus 
these digital modulations are bandwidth efficient and have a constant envelope. Therefor: 
high-efficiency nonlinear power amplifiers can be used in the transmission of the signal: 
CPFSK and CPM have been treated extensively in the technical journals and in textbook: 
A thorough treatment of CPM can be found in the book by Anderson et al. (1986). Th 
journal papers by Aulin and Sundberg (1981, 1982a, 1982b, 1984) and by Aulin et a 
(1981) provide a detailed analysis of the performance characteristics of CPM. The tutori: 
paper by Sundberg (1986) gives a very readable overview of CPM, its demodulation, an 
its performance characteristics. This paper also contains a comprehensive list of reference 

In the final section of this chapter, we compared the different types of on 
dimensional and multidimensional signals introduced in the previous chapter and th 
chapter. By fixing the probability of error for these different signals to some desired valu 


Problems i 533 


e.g., P, = 1075, we compared the different digital modulations on the basis of their bit 
rate-to-bandwidth ration, R,/W (bits/second/Hz) and the corresponding €,/No (SNR/bit) 
required to attain the selected probability of error. We observed that, in the case of PAM, 
PSK, DPSK, and QAM, as the number of signals M increases, R/W increases and the 
SNR/bit also increases. As a consequence, these digital modulation methods are suitable 
for achieving high data rates (R,/W > 1) provided that the SNR/bit can be increased 
accordingly. In contrast, M-dimensional signal waveforms with M > 4 have the charac- 
teristics that R,/W < 1 as M increases but the required SNR/bit decreases. Therefore, the 
M-ary multidimensional signals result in an expansion of the bandwidth required for their 
transmission, but this cost in channel bandwidth allows us to transmit the digital signals at 
a lower SNR/bit. As M approaches infinity, the limiting value of the SNR/bit is —1.6 dB. 
This is the lowest possible value of the SNR/bit that we can have in a digital communica- 
tion system and still maintain reliable communications. This limiting value of the SNR/bit 
is called the Shannon limit. 





PROBLEMS 


9.1 The lowpass equivalent signal waveforms for three signal sets are shown in 
Figure P-9.1. Each set may be used to transmit one of four equally probable mes- 
sages over an additive white Gaussian noise channel with a noise power spectral 
density Mo 


1. Classify the signal waveforms in Set I, Set II, and Set III. In other words, state 
the category or class to which each signal set belongs. 
2. What is the average transmitted energy for each signal set? 


3. For signal Set I, specify the average probability of error if the signals are detected 
coherently. 


4. For signal Set II, give a union bound on the probability of a symbol error if the 
detection is performed (a) coherently and (b) noncoherently. 


5. Is it possible to use noncoherent detection on signal Set III? Explain. 


6. Which signal set or signal sets would you select if you wanted to achieve a 
bit-rate-to-bandwidth (#2) ratio of at least 2? Explain your answer. 


9.2 Show that the correlation coefficient of two adjacent signal points corresponding to 
the vertices of an N-dimensional hypercube with its center at the origin is given by 


_N-2 
Y= N ’ 


and their Euclidean distance is 
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Figure P-9.1 
9.3 Consider a set of M orthogonal signal waveforms Sm (t), 1 <m < M,and0 <t < 


T, all of which have the same energy €. Define a new set of M waveforms as 


m 


Sn (t) =Sm(t)- Fe), 1s 
0<t 


<M 
<T 


Show that the M signal waveform A «| have equal energy, given by 


€ =(M-1)%/M, 


and are equally correlated, with correlation coefficient 


-if nOs, Qdt =—-—— 
Ymn = o Sn Eo M-1° 
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9.4 Consider a biorthogonal signal set with M = 8 signal points. Determine a union 
bound for the probability of a symbol error as a function of €,/No. The signal points 
are equally likely a priori. 


9.5 Consider an M-ary digital communication system where M = 2%, and N is the 
dimension of the signal space. Suppose that the M signal vectors lie on the vertices 
of a hypercube that is centered at the origin, as illustrated in Figure 9.9. Determine 
the average probability of a symbol error as a function of @,/No, where @, is the 
energy per symbol, No/2 is the power spectral density of the AWGN, and all signal 
points are equally probable. 


9.6 Consider the signal waveform 


s(t) =} cip (t - aT), 


k=1 


where p (t) is a rectangular pulse of unit amplitude and duration T,. The {c;} may 
be viewed as a code vector c = [c1, C2, ..., Cn], where the elements c; = +1. Show 
that the filter matched to the waveform s(t) may be realized as a cascade of a filter 
matched to p(t) followed by a discrete-time filter matched to the vector c. Determine 
the value of the output of the matched filter at the sampling instant t = nT,. 


9.7 A Hadamard matrix is defined as a matrix whose elements are +1 and row vectors 
are pairwise orthogonal. In the case when n is a power of 2, an n x n Hadamard 
matrix is constructed by means of the recursion 


_}1 i _| A, A, 
aaral aa 
1. Let c; denote the ith row of an n x n Hadamard matrix as previously defined. 


Show that the waveforms constructed as 


n 


si) =J capt- kT), i=1,2...;n 
k=1 


are orthogonal, where p (t) is an arbitrary pulse confined to the time interval 
0<t<T,. 


2. Show thatthe matched filters (or cross correlators) for the n waveforms {s; (t)} 
can be realized by a single filter (or correlator) matched to the pulse p (t) fol- 
lowed by a set of n cross correlators using the code words {c;}. 


9.8 The discrete sequence 
te = Vbcce +nk, k=1,2,...,n 


represents the output sequence of samples from a demodulator, where cge = +1 
are elements of one of two possible code words, c = [1,1,..., 1] and c2 = 
[1,1,...,1,—-1,...,—1]. The code word c2 has w elements, which are +1, and 
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9.9 


9.10 





9.11 




















9.12 
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n — w elements, which are —1, where w is some positive integer. The noise sequence 
2 


{ną} is white Gaussian with variance o^. 
1. What is the optimum maximum-likelihood detector for the two possible trans- 
mitted signals? 


2. Determine the probability error as a function of the parameters (c?, Sp, w). 


3. What is the value of w that minimizes the error probability? 


In Section 9.5.1, we showed that the minimum frequency separation for the orthog- 
onality of binary FSK signals with coherent detection is Af = F- However, a 
lower error probability is possible with coherent detection of FSK if Af is increased 
beyond F- Show that the minimum value of the correlation occurs at Af = Lus, 
determine the probability of error for this choice of Af. 


Consider the phase-coherent demodulator for M-ary FSK signals as shown in 
Figure 9.11. 


1. Assume that the signal 


28; 
uo(t) = zp 008 2x fet, 0<t<T 


was transmitted; determine the output of the we — 1 correlators att = T that 
corresponds to the signals um(t),m = 1, 2, . — 1, when dm 4 om. 


2. Show that the minimum frequency separation so for the signal orthogo- 
nality at the demodulator when bm Æ Ọm is Af = + 


In the demodulation and noncoherent detection of M-ary FSK signals, as illustrated 
in Figure 9.12, show that the 2M noise samples that are given in Equations (9.5.8) 
and (9.5.9) are zero-mean, mutually independent Gaussian random variables with an 
equal variance o? = =2 


In on--off keying of a carrier modulated signal, the two possible signals are 
So (t) = 0, 0<t<T, 
sı (t) = 2, cos2ufet, O<t<T, 


The corresponding received signals are 
r(t)=n(t), O<t<h 
r(t)= J2 b cos (2na fet +o) +n (t), O<t<T, 
where ¢ is the carrier phase and n (t) is AWGN. 


1. Sketch a block diagram of the receiver (demodulator and detector) that employs 
noncoherent (envelope) detection. 








| 
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2. Determine the probability density functions for the two possible decision vari- 
ables at the detector corresponding to the two possible received signals. 


3. Derive the probability of error for the detector. 


9.13 Digital information is to be transmitted by carrier modulation through an additive 
Gaussian noise channel with a bandwidth of 100 kHz and No = 107!° W/Hz. Deter- 
mine the maximum rate that can be transmitted through the channel for four-phase 
PSK, binary FSK, and four-frequency orthogonal FSK that is detected noncoher- 
ently. 


9.14 In an MSK signal, the initial state for the phase is either 0 or x radians. Determine 
the terminal phase state for the following four pairs of input data: (a) 00, (b) 01, (c) 
10, and (d) 11. 


9.15 A continuous-phase FSK signal with h = 1/2 is represented as 


s(t) = +J cos (= ) cos2n fet + ERTE AL) sin 27 fet , 


0<t<2T, 


where the + signs depend on the information bits transmitted. 


1. Show that this signal has a constant amplitude. 
2. Sketch a block diagram of the modulator for synthesizing the signal. 


3. Sketch a block diagram of the demodulator and detector for recovering the infor- 
mation. 


9.16 Sketch the phase tree, the state trellis, andthe state diagram for partial-response CPM 
with h = } and 
1 
= iF: 0<t<2T 
0, otherwise 
9.17 Determine the number of terminal phase states in the state trellis diagram for (a) a 
full-response binary CPM (CPFSK) with either h = Z or 3 and (b) a partial-response 


L = 3 binary CPM with either h = 2 or 3. 


COMPUTER PROBLEMS 


9.1 Simulation of Detector Performance for M = 4 Orthogonal Signals 


The purpose of this problem is to estimate and plot the probability of error as a func- 
tion of the SNR for a digital communication system thatemploys M = 4 orthogonal 
signals and transmits them over an additive white Gaussian noise channel. The model 
of the system to be simulated is shown in Figure CP-9.1. 
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Figure CP-9.1 Simulation model for the detection of M = 4 orthogonal signals. 


As shown, we simulate the generation of the random variables yo, y1, y2, Y3, whic 
constitute the input to the detector. First, we may generate a binary sequence < 
zeros and ones that occur with equal probability and are mutually statistically ind: 
pendent. The binary sequence is grouped into pairs of bits, which are mapped in: 
the corresponding signal components. As an alternative to generating the individu 
bits, we may generate pairs of bits, as described in CP-8.5. In either case, we ha‘ 
the mapping of the four symbols into the signal points. 


00 > so = (V%,, 0, 0, 0) ; 

= (0, V;, 0, 0) ; 
10 —> s2 = (0,0, V@;, 0); 
11 > s3 = (0, 0, 0, /%,). 


The additive noise components no, nı, n2, n3 are generated by means of four Gau 
sian noise generators, each having zero mean and variance o? = No /2. For conv 
nience, we may normalize s = 1 and vary a”. Since s, = 2%,, it follows tk 
Eb = 1/2. The detector input consists of the received signal vector y = s; + n, i 
0, 1, 2, 3. The detector computes the correlation metrics y-s;, j = 0, 1, 2, 3, a 
detects the signal having the largest correlation value. An error counter is used 
count the number of bit errors. 


0l > s 
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9.2 


9. 


W 


9.4 


Perform the simulation for the transmission of 10,000 symbols (20,000 bits) at sev- 
eral different values of SNR/bit that covers the range 0 < 10 log€,/No < 8 dB. Plot 
the estimated probability of a bit error and compare the estimated values with the 
theoretical expression for the probability of a bit error. Comment on the results. 


Simulation of Detector Performance for M = 8 Orthogonal Signals 


Modify the simulation described in CP-9.1 to transmit M = 8 orthogonal signals. 
Perform the simulation and plot the results. 


Correlation of Biorthogonal Signal Waveforms 


A set of M = 4 biorthogonal signal waveforms is shown in Figure 9.5. Note that 
S2(t) = —so(t) and s3(t) = —sı (t). Therefore, only two correlators are needed at the 
receiver to process the received signal, one for correlating r (t) with sı (t) and one for 
correlating r (t) with so(t). 


Suppose the received signal r(t) is sampled at a rate of Fs = 40/T and the correla- 
tions at the receiver are performed numerically, that is, 


k 
WET) =} rT), k = 1,2,...,20 
n=1 
k 
NT) = Do raTa NT), k = 21, 22,...,40 
n=21 


Compute and plot yo(k7;) and yı(kTs) when (a) so(t) is transmitted, (b) s,(¢) is 
transmitted, (c) —so(t) = s2(t) is transmitted, and (d) —sı (t) = s3(t) is transmitted 
and the additive noise is zero-mean, white, and Gaussian and the variance of the 
noise samples is o? = 0, o? = 0.1, and o? = 1. 


Simulation of Detector Performance for M = 4 Biorthogonal Signals 


The purpose of this problem is to estimate and plot the probability of error as a func- 
tion of the SNR for a digital communication system that employs M = 4 biorthog- 
onal signals and transmits them over an additive white Gaussian noise channel. The 
model for the system to be simulated is shown in Figure CP-9.4. 


As shown, we simulate the generation of the random variables yo and yı, which 
constitute the input to the detector. We begin by generating a binary sequence of zeros 
and ones that occur with equal probability and are mutually statistically independent, 
as described in CP-8.1. The binary sequence is grouped into pairs of bits, which are 
mapped into the corresponding signals as follows: 


00 + so = (V&;, 0) ; 
0l > sı = (0, V,); 
10 > s2 = (0, -V@,); 

(-V%;,0). 


11 > s3 = 
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Figure CP-9.4 Simulation model for the detection of M = 4 biorthogonal signals. 


Since s2 = —sı and s3 = —So, the demodulation requires two correlators or twc 
matched filters, whose outputs are yo and yı. The additive noise components no anc 
nı are generated by means of two Gaussian noise generators, each having mean zerc 
and variance ø? = No/2. For convenience, we may normalize the symbol energy tc 
s = 1 and vary the noise variance o?. Since €, = 2%,, it follows that €, = 1/2 
The detector output is compared with the transmitted sequence of bits and an erro 
counter is used to count the number of bit errors and the number of symbol errors. 


Perform the simulation for the transmission of 10,000 symbols (20,000 bits) at sev- 
eral different values of SNR/bit that covers the range 0 < 10logiy @,/No < 8 dB 
Plot the estimated probability of a symbol error and compare the estimated values 
with the theoretical expression for the probability of a symbol error. Comment or 
the results. 


Noncoherent Demodulation of Binary FSK Signals 


The objective of this problem is to digitally implement a correlation-type demodu. 
lator in a digital communication system that employs binary FSK signal waveforms 
which are given as 

u(t) =cos2nfit, O<t < Tp, 

u(t) = cos 2n fat, 0 <t < Tp, 


where fı = 1000/T, and f2 = fı + 1/Tb. The channel is assumed to impart a phas 
shift @ = 45° on each of the transmitted signals. Consequently, the received signa 
in the absence of noise is 


r(t) = cos(2n fit + x/4), i =1,2...,0 <t < T». 


The correlation-type demodulator for u1 (t) and u2(t) is to be implemented digitally 


Suppose we sample the received signal r(t) at a rate F, = 5000/T, within the 
bit interval 0 < t < Ty. Thus, the received signal r(t) is represented by the 500( 
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samples {r(n/F;)}. The correlation demodulator multiplies {r (n/F,)} by the sam- 
pled versions of u,(t) = cos2n fit, v(t) = sin2nfit, u2(t) = cos2n fot, and 
v2(t) = sin 27 fot. Thus, the correlator outputs are 


Yık) = ne (z)m (4). &=1,2,...,5000; 


yis(k) = a (£)u (2). «=1,2,...,5000; 
0 


n=! 


yae(k) = = (A) m (4); k=1,2,...,5000; 


Yas(k) = a (z)u(k), k=1,2,...,5000. 


The detector is a square-law detector that computes the two decision variables 


yı = y?,(5000) + y? (5000), 
Y2 = y3,(5000) + y2 (5000) 


and selects the information bit corresponding to the larger decision variable. 


Write a program that implements the correlation demodulator for the binary FSK 
signal processing. Assuming that r (t) = cos 21 fit, plot the four sequences {y1c(k), 
Yis(k), Y2c(k), Y2s(k)} for O < k < 5000. Repeat the computation when r(t) = 
cos 2v fat. What are the values of yı and y2 in each of the two experiments? 


9.6 Simulation of Noncoherent Detection of Binary FSK 


The objective of this problem is to estimate the probability of error for a commu- 
nication system that employs binary FSK modulation. The binary FSK waveforms 
are 


u(t) = cos2nr fit, 0<t< Th 

m(t) = cos2n(fit $) t 0<t< T) 
The block diagram of the binary FSK system to be simulated is shown in Figure 
CP-9.6. 


Since the signals are orthogonal, when u(t) is transmitted, the first demodulator 
output is 

Vie = VE, cosh + nic» 

Yis = VÊ sin d + nis 


and the second demodulator output is 


Yx = Mc, 


Y2s = Ms, 
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Figure CP-9.6 Simulation model for the noncoherent detection of binary FSK signals. 


where nic, Nis, N2c, and Ns are mutual statistically independent, zero-mean Gaus- 
sian random variables with equal variance o? and ¢ represents the channel phase 
shift. The phase shift @ may be set to zero for convenience. The square-law detector 
computes 

yw=y+yis 

Y2 = Yr + V5, 


and selects the information bit corresponding to the larger of these two decision 
variables. An error counter measures the error rate by comparing the transmitted 
sequence to the output of the detector. 


Perform the simulation of the binary FSK system as described for 10,000 bits for 
the range of the 0 < 10log,)€@,/No < 12 dB and plot the estimated probability of 
error. Also plot the theoretical bit-error probability of binary FSK and compare the 
simulated performance to the theoretical error probability. Comment on the results. 
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In the last two chapters, we considered digital communication over an additive white 
Gaussian noise (AWGN) channel and evaluated the probability of error performance of 
the optimum receiver for several different types of digital modulation techniques. In this 
chapter, we treat digital communication over a channel that is modeled as a linear filter with 
a bandwidth limitation. The bandlimited channels most frequently encountered in practice 
are telephone channels, microwave line-of-sight (LOS) radio channels, satellite channels, 
and underwater acoustic channels. 

In general, a linear filter channel imposes more stringent requirements on the design 
of modulation signals. Specifically, the transmitted signals must be designed to satisfy the 
bandwidth constraint imposed by the channel. The bandwidth constraint generally pre- 
cludes the use of rectangular pulses at the output of the modulator. Instead, the transmitted 
signals must be shaped to restrict their bandwidth to that available on the channel. The 
design of bandlimited signals is one of the topics treated in this chapter. 

We will see that a linear filter channel distorts the transmitted signal. The channel 
distortion results in intersymbol interference (ISI) at the output of the demodulator and 
leads to an increase in the probability of error at the detector. Devices or methods for 
correcting or undoing the channel distortion, called channel equalizers, are then described. 





a ce asd 


10.1 CHARACTERIZATION OF BANDLIMITED CHANNELS AND SIGNAL DISTORTION 


A bandlimited channel such as a telephone wireline is characterized as a linear filter with 
impulse response c(t) and frequency response C(f), where 


Cif) = ee c(t)e22™F* dt. (10.1.1) 


If the channel is a baseband channel that is bandlimited to Be Hz, then C(f) = 0 for 
|f| > Be. Any frequency components at the input to the channel that are higher than Be 
Hz will not be passed by the channel. For this reason, we consider the design of signals 
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ICCP) 


o(f) 


Figure 10.1 Magnitude and phase responses 
of bandlimited channel. 


for transmission through the channel that are bandlimited to W = Be Hz, as shown in 
Figure 10.1. Henceforth, W will denote the bandwidth limitation of the signal and the 
channel. i 

Now, suppose that the input to a bandlimited channel is a signal waveform gry (t), 
where the subscript T denotes that the signal waveform is the output of the transmitter. 
Then, the response of the channel is the convolution of gr(t) with c(t), i.e., 


oe} 
h(t) -=f c(t)gr(t — t)dt = c(t) * gr (t), (10.1.2) 
— 0 
or, when expressed in the frequency domain, we have 


A(f) =C(f)Gr(f), (10.1.3) 


where Gr(f) is the spectrum (Fourier transform) of the signal gr(t) and H(f) is the 
spectrum of h(t). Thus, the channel alters or distorts the transmitted signal g7(t). 

Let us assume that the signal at the output of the channel is corrupted by AWGN. 
Then, the signal at the input to the demodulator is of the form A(t) + n(t), where n(t) 
denotes the AWGN. The linear filter channel model is shown in Figure 10.2. 

From Chapter 8, we recall that in the presence of AWGN, a demodulator that employs 
a filter that is matched to the signal h(t) maximizes the signal-to-noise ratio (SNR) at its 
output. Therefore, let us pass the received signal r(t) = h(t) + n(t) through a filter that 
has a frequency response 


Be ER OM Mee ee 
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Figure 10.2 Linear filter model for a bandlimited channel. 


Gr(f) = H*(fye "4%, (10.1.4) 


where to is some nominal time delay at which we sample the filter output. The subscript R 
denotes that the matched filter is at the receiver. 

The signal component at the output of the matched filter at the sampling instant 
t = to is 


s= f IH(f)Pdf = Bn, (10.1.5) 


which is the energy in the channel output waveform h(t). The noise component at the 
output of the matched filter has a zero mean and a power spectral density 








N 
Saf) = FEAP. (10.1.6) 
Hence, the noise power at the output of the matched filter has a variance 
i No [7 No 
a! spat == | |H(f) df = > (10.1.7) 
saD —00 
Then the SNR at the output of the matched filter is 
S Gi, 2% 
(Š) E EA ee (10.1.8) 
NJo No€n/2 No 


This is the result for the SNR at the output of the matched filter that was obtained in 
Chapter 8, except the received signal energy é, has replaced the transmitted signal energy 
s. Compared to the previous result, the major difference in this development is that the 
filter impulse response is matched to the received signal h(t) instead of the transmitted 
signal. Note that the implementation of the matched filter at the receiver requires that h(t) 
or, equivalently, the channel impulse response c(t) must be known to the receiver. 


Example 10.1.1 
The signal pulse 
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Figure 10.3 Thesignal pulse in (b) is transmitted through the ideal bandlimited channel shown in (a). 
The spectrum of g7(t) is shown in (c). 
is transmitted through a baseband channel with a frequency-response characteristic as show: 


in Figure 10.3(a). The signal pulse is illustrated in Figure 10.3(b). The channel output is cor 
rupted by AWGN with the power spectral density No/2. Determine the matched filter to th 
received signal and the output SNR. 


Solution This problem is most easily solved in the frequency domain. First, the spectrum c 
the transmitted signal pulse is 


Ff sin fT) -jagr 
Gr) => FTA PTD 
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The spectrum |G7(f)|? is shown in Figure 10.3(c). Hence, 
H(f) =C(f)Gr(f) 
eae 


0, otherwise ` 


Then, the signal component at the output of the filter matched to H (f) is 


Ww 
E = | Iornrar 





w : 2 
1 i. _ Ginn fT) df 


~ Qn)? Jw PA- PTY 
= T f WT, sin? Ta 
= O? Jwr a2 a A 


The variance of the noise component is 


No f” Ne 
=] IGH Paf = 2. 
2 J_w 2 


S\ _ 26; 

N) M` 
In this example, we observe that the signal at the input to the channel is not bandlimited. 
Hence, only a part of the transmitted signal energy is received, i.e., only the signal energy that 
falls within the passband |f| < W ofthe channel. The amount of signal energy at the output 


of the matched filter depends on the value of the channel bandwidth W when the signal pulse 
duration is fixed (see Problem 10.1). The maximum value of n, obtained as W —> 00, is 





Hence, the output SNR is 


o0 T 
max, = f iGrnr = f e7(t)dt =8;, 


where @, is the energy of the signal pulse g7(t). B 


In this development, we considered the transmission and reception of only a single 
isolated signal waveform gr (t) through a bandlimited channel with the impulse response 
c(t). We observed that the performance of the system is determined by +, the energy in the 
received signal h(t). To maximize the received SNR, we must make sure that the spectrum 
of the transmitted signal waveform gy (t) is limited to the bandwidth of the channel. The 
impact of the channel bandwidth limitation is felt when we consider the transmission of a 
sequence of signal waveforms. This problem is treated in the next section. 


10.1.1 Intersymbol Interference in Signal Transmission 
Let us consider the baseband pulse amplitude modulation (PAM) communication system 


illustrated by the functional block diagram in Figure 10.4. The system consists of a trans- 
mitting filter having an impulse response gr(t), the linear filter channel with AWGN, a 
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Figure 10.4 Block diagram of a digital PAM system. 


receiving filter with an impulse response gr(t), a sampler that periodically samples the 
output of the receiving filter, and a symbol detector. The sampler requires the extraction 
of a timing signal from the received signal as described in Section 8.9. This timing signal 
serves as a clock that specifies the appropriate time instants for sampling the output of the 
receiving filter. 

Let us consider digital communication by means of M-ary PAM. Hence, the inpu 
binary data sequence is subdivided into k-bit symbols, and each symbol is mapped intc 
a corresponding amplitude level that amplitude modulates the output of the transmitting 
filter. The baseband signal at the output of the transmitting filter (the input to the channel’ 
may be expressed as 


v(t) = J. angr(t — nT), (10.1.9 


n=—00 


where T = k/R» is the symbol interval (1/T = R,/k isthe symbol rate), R, is the bit rate 
and {a,} is a sequence of amplitude levels corresponding to the sequence of k-bit blocks o; 
information bits. 

The channel output, which is the received signal at the demodulator, may be expres 


sed as 
foe) 


r(t) = D a,h(t — nT) +n(t), (10.1.10 


n=—00 


where A(t) is the impulse response of the cascade of the transmitting filter and the channel 
Thus, A(t) = c(t) * gr(t), c(t) is the impulse response of the channel, and n (t) represent 
the AWGN. 

The received signal is passed through a linear receiving filter with the impuls 
response gr(t) and frequency response Gr(f). If gr(t) is matched to A(t), then its outpu 
SNR is maximum at the proper sampling instant. The output of the receiving filter may b 
expressed as 

o0 
y(t)= $ anx(t —nT) + wl), (10.1.11 


n=—00 


where x(t) = h(t) * gr(t) = gr(t) * c(t) * gr(t) and w(t) = n(t) * gr(t) denotes th 
additive noise at the output of the receiving filter. 
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To recover the information symbols {an}, the output of the receiving filter is sampled 
periodically, every T seconds. Thus, the sampler produces 


y(mT) = } > anx(mT — nT) + w(mT) (10.1.12) 


n=—00 


or, equivalently, 


00 
Yn = J AnXm—n + Wm 


n=—OO 


= Xom + Y AnXm-n + Wm, (10.1.13) 
nm 


where xm = x(mT), Wm = w(mT), and m = 0, £1, +2,..... 

The first term on the right-hand side (RHS) of Equation (10.1.13) is the desired sym- 
bol am, scaled by the gain parameter xo. When the receiving filter is matched to the received 
signal A(t), the scale factor is 


w= [Ped = [near 


—0o = 


W 
=f IGr(PPICPAdf = En, (10.1.14) 
-w 


as indicated by the development of Equations (10.1.4) and (10.1.5). The second term on 
the RHS of Equation (10.1.13) represents the effect of the other symbols at the sampling 
instant £ = mT, called the intersymbol interference. In general, ISI causes a degrada- 
tion in the performance of the digital communication system. Finally, the third term, Wm, 
which represents the additive noise, is a zero-mean Gaussian random variable with variance 
ož = NoÉn/2, previously given by Equation (10.1.7). 

By appropriately designing the transmitting and receiving filters, we can satisfy the 
condition x, = 0 for n # 0, so that the ISI term vanishes. In this case, the only term 
that can cause errors in the received digital sequence is the additive noise. The design of 
transmitting and receiving filters is considered in Section 10.3. 


10.1.2 Digital Transmission through Bandlimited Bandpass Channels 


The development given in Section 10.1.1 for baseband PAM is easily extended to car- 
rier modulation via PAM, quadrature amplitude modulation (QAM), and phase-shift key- 
ing (PSK). In a carrier amplitude-modulated signal, the baseband PAM given by v(t) in 
Equation (10.1.9) modulates the carrier, so thatthe transmitted signal u(t) is simply 


u(t) = v(t) cos 2m fet. (10.1.15) 


Thus, the baseband signal v(t) is shifted in frequency by fo. 
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A QAM signal is a bandpass signal which, in its simplest form, may be viewed ag 
two amplitude-modulated carrier signals in phase quadrature. That is, the QAM signal may 
be expressed as 


u(t) = v(t) cos 27 fet — v(t) sin 27 fet, (10.1.16) 


where 


foe) 
v(t) = D ancgr(t — nT), 
n=—00 


A (10.1.17) 
v(t) = D Ans8t(t — nT), 


n=—00 


and {anc} and {ans} are the two sequences of amplitudes carried on the two quadrature car- 
riers. A more compact mathematical representation of the baseband signal is the equivalent 
complex-valued baseband signal 


v(t) = v-(t) + jus(t) 


Yo Gre + jans)gr(t — nT) 


n=—oo 
foe} 

= SS agr(t—nT), (10.1.18 
n=—0o 


where the sequence {an = anc + jans} is now a complex-valued sequence representin; 
the signal points from the QAM signal constellation. The corresponding bandpass QAN 
signal u(t) may also be represented as 


u(t) = Re[v(t)e/?**]. (10.1.19 


In a similar manner, we can represent a digital carrier-phase-modulated signal as i 
Equation (10.1.19), where the equivalent baseband signal is 


v(t) = J. angr(t — nT) (10.1.20 


n=—oo 


and the sequence {a,} takes the value from the set of possible (phase) values {e~/2""/M 
m = 0,1,..., M — 1}. Thus, all three carrier-modulated signals, PAM, QAM, and PSI 
can be represented as in Equations (10.1.19) and (10.1.20), where the only difference is i 
the values taken by the transmitted sequence {an}. 

The signal v(t) given by Equation (10.1.20) is called the equivalent lowpass signa 
In the case of QAM and PSK, this equivalent lowpass signal is a baseband signal which i 
complex-valued because the information-bearing sequence {a,,} is complex valued. In th 
case of PAM, v(t) is a real-valued baseband signal. 
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When transmitted through the bandpass channel, the received bandpass signal may 
be represented as 


q(t) = Re [re], (10.1.21) 


where r (t) is the equivalent lowpass (baseband) signal, which may be expressed as 


r(t)= J ahlt —nT) +n) (10.1.22) 


n=—00 


and where, as in the case of baseband transmission, A(t) is the impulse response of the 
cascade of the transmitting filter and the channel; i.e., h(t) = c(t) x gr(t), where c(t) is 
the impulse response of the equivalent lowpass channel and n(t) represents the additive 
Gaussian noise expressed as an equivalent lowpass (baseband) noise. 

The received bandpass signal can be converted to a baseband signal by multiplying 
q(t) with the quadrature carrier signals cos 2x f-t and sin 2x fet and eliminating the double 
frequency terms by passing the two quadrature components through separate lowpass fil- 
ters, as shown in Figure 10.5. Each one of the lowpass filters is assumed to have an impulse 
response gr(t). Hence, we can represent the two quadrature components at the outputs of 
these lowpass filters as an equivalent complex-valued signal of the form 


œo 


y(t) = J anx(t—nT) + w(t), (10.1.23) 


n=—0 


which is identical to the form given by Equation (10.1.11) for the real baseband signal. 
Consequently, the signal design problem for bandpass signals is basically the same as that 
described in Section 10.1.1 for baseband signals. 

In Section 10.3, we consider the design of bandlimited transmitting and receiving 
filters that either eliminate ISI or control ISI. However, first we will determine the power 
spectral density of the transmitted digital signal. Thus, we will establish the relationship 
between the spectral characteristics of the transmitted signal and the channel bandwidth 
requirements. 





cos 2x ft 
LPF S 
a xen) + wÀ 
Gio: | | Dae i 
Received 
signal 
LPF “|in È a, x(t—nT)| + wt) 
G(f) pzz% d 
—sin 2x f,t 


Figure 10.5 Conversion of the bandpass received signal to baseband. 
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10.2 THE POWER SPECTRUM OF DIGITALLY MODULATED SIGNALS 


In this section we describe the power spectrum of linearly modulated digital signals, such 
as PAM, PSK and QAM. 

As shown in Section 10.1.2, the equivalent baseband transmitted signal for a digital 
PAM, PSK, or QAM signal is represented in the general form as 


v(t) = > angr(t —nT), (10.2.1 


n=—00 


where {an} is the sequence of values selected from either a PAM, QAM, or PSK signa: 

constellation corresponding to the information symbols from the source, and gr (t) is the 

impulse response of the transmitting filter. Since the information sequence {an} is random 

v(t) is a sample function of a random process V (t). In this section, we evaluate the powe: 

density spectrum of V(t). Our approach is to derive the autocorrelation function of V(t 

and then to determine its Fourier transform. The derivation is carried out in Appendix 10A 
The power spectrum of the baseband signal v(t) is expressed in the form 











1 
Sy(f) = FENG AP, (10.2.2 





where S4( f ) is the power spectrum of the information sequence {an}, defined as 
oo) 
Salf) = >> Rame ss" (10.2.3 
m=—00 


Gr (f) is the spectrum of the transmitting filter, and Ra[m] is the autocorrelation sequenc: 
of the information sequence {an}, defined as 


Ral] = E (a;an4m) « (10.2.4 














The result in Equation (10.2.2) illustrates the dependence of the power spectral den 
sity Sy (f) of the transmitted signal on (a) the spectral characteristics Gr(f) of the trans 
mitting filter and (b) the spectral characteristics Sa( f) of the information sequence {an} 
Both G;(f) and S,(f) can be designed to control the shape and form of the power spectre 
density of the transmitted signal. 

Whereas the dependence of Sy(f) on Gr(f) is easily understood, the effect c 
the autocorrelation properties of the information sequence {a,} is more subtle. First, w 
observe that for an arbitrary autocorrelation R,[m], the corresponding power spectral den 
sity Sa( f) is periodic in frequency with period 1/T. In fact, we note that Sa (f), give 
by Equation (10.2.3), has the form of an exponential Fourier series with {R,[m]} as th 
Fourier coefficients. Consequently, the autocorrelation sequence {R,[m]} is simply 


1/2T 
R,[m] =T 1 Salf) "T df. (10.2.£ 
—1/2T 
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Second, let us consider the case in which the information symbols in the sequence 
{a,} are mutually uncorrelated. Then, 


2 2 
_ fog +m, m=0 
R,[m] karan k m x 0 


a? 


(10.2.6) 


where o? = E(a?) — m? is the variance of an information symbol. By substituting for 
R,[m] into Equation (10.2.3), we obtain the power spectral density 


loo} 
Salf) =o, +m > enim? (10.2.7) 
m=—oo 
The term involving the summation on the RHS of Equation (10.2.7) is periodic with period 
1/T. It may be viewed as the exponential Fourier series of a periodic train of impulses 
where each impulse has an area 1/T (see Table 2.1). Therefore, Equation (10.2.7) can be 
expressed as 


ae eee m 

m a Bie 

Saf) = 02 + a 5(F =) (10.2.8) 
m=—0O 

Substitution of this expression into Sy(f) given by Equation (10.2.2) yields the desired 

result for the power spectral density of the transmitted signal V (t) when the sequence of 

information symbols is uncorrelated; i.e., 


Suit) = FOr + e x [cr (F)'s(r- =). (10.2.9) 


The expression for the power spectral density of the transmitted signal given by 
Equation (10.2.9) is purposely separated into two terms to emphasize the two different 
types of spectral components. The first term o2|Gr(f )/?/T is the continuous spectrum 
and its shape depends on Gr(f). The second term in Equation (10.2.9) consists of discrete 
frequency components spaced 1/T apart in frequency. Each spectral line has a power that 
is proportional to |Gr(f)|* evaluated at f = m/T. We note that the discrete frequency 
components can be eliminated by selecting the information symbol sequence {an} to have 
zero mean. This condition is usually imposed in digital modulation methods because dis- 
crete spectral lines are considered to be undesirable. To be specific, the mean ma in digital 
PAM, PSK, or QAM signals is easily forced to be zero by selecting the signal constellation 
points to be symmetrically positioned in the complex plane relative to the origin. Under 
the condition that ma = 0, we have 


2 
Sv(f) = “IGr( Al. (10.2.10) 


Thus, the system designer can control the spectral characteristics of the transmitted digi- 
tal signal by selecting the transmitter filter Gr(f). The following example illustrates the 
spectral shaping resulting from gr (t). 


Example 10.2.1 


Determine the power spectral density in Equation (10.2.10), when g7(t) is the rectangular 
pulse shown in Figure 10.6(a). 
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8r) | 





1 
T T T 


my 
mj 


d 
T 


(b) 


Figure 10.6 A rectangular pulse gy (t) and its energy density spectrum |G7 (f )|?. 


Solution The Fourier transform of gr(t) is 





Gr(f) = apt TÍT inst, 
Hence, 
2 2 (sina fT 5 
arp = cary (SEL) 
= (AT) sin? (fT). 


This spectrum is illustrated in Figure 10.6(b). We note that it contains nulls at multiples of 1/1 
in frequency and that it decays inversely as the square of the frequency variable. I 


Example 10.2.2 illustrates the spectral shaping that can be achieved by operation 
performed on the input information sequence. 


Example 10.2.2 
Consider a binary sequence {b,}, from which we form the symbols 


Qn = bn + Dnt. 


The {b,} are assumed to be uncorrelated binary valued (+1) random variables, each having 
zero mean and a unit variance. Determine the power spectral density of the transmitted signa 
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Solution The autocorrelation function of the sequence {a,} is 
Rm] = E (anan+m) 
= E((bn + bn-1)(bn+m € bn+m-1)) 


2 m=0 
={1 m=+Łl 
0, otherwise 


Hence, the power spectral density of the input sequence is 
Sa(f) =2(1 +cos2xfT) 
= 4cos xfT 


and the corresponding power spectrum for the modulated signal is, from Equation (10.2.2), 


Sn A 
S (f) = gler? cos? xfT. 


Figure 10.7 illustrates the power density spectrum S4 (f) of the input sequence, and the cor- 
responding Sy (f) when Gr(f) is the spectrum of the rectangular pulse. a 


As demonstrated in the example, the transmitted signal spectrum can be shaped by 
having a correlated sequence {an} as the input to the modulator. 


Salf) 





1 
T 


~|- 
N 
N 
N 
q 


(b) 


Figure 10.7 Power density spectra for (a) information sequence and (b) PAM modulated signal. 
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Having obtained the power spectrum of the baseband signal v(t), itis a simple matte; 
to find the power spectrum of the corresponding bandpass signal 


u(t) = Re[v(t)e/?*/"]. 


It is shown in Appendix 10A that the power spectrum of u(t) is given as 


1 
Su(f) = Z [Sv(f — fe) +Sv(f + fol- 


Hence, the power spectrum of the bandpass signal is simply determined by shifting the 
spectrum of u(t) by the carrier frequency +f, and scaling the result by 1/4. 


10.3 SIGNAL DESIGN FOR BANDLIMITED CHANNELS 


In this section, we consider the problem of designing a bandlimited transmitting filter. First 
the design will be done under the condition that there is no channel distortion. Later, wi 
consider the problem of filter design when the channel distorts the transmitted signal. Sinc: 
H(f) = C(f)Gr(f), the condition for distortion-free transmission is that the frequency 
response characteristic C (f) of the channel must have a constant magnitude and a linea 
phase over the bandwidth of the transmitted signal, i.e., 


_ | Coemi? "o, [Fl <W 
C(f) = | 0, ifl> W (10.3.1 


where W is the available channel bandwidth, tọ represents an arbitrary finite delay, whicl 
we set to zero for convenience, and Co is a constant gain factor, which we set to unity fo 
convenience. Thus, underthe condition thatthe channel is distortion free and the bandwidtl 
of gr (t) is limited to W, we have H(f) = Gr (f). Consequently, the matched filter at th 
receiver has a frequency response Gr (f) = G} (f), and its output at the periodic samplin; 
times t = mT has the form 


y(mT) = x(O)am + 5 anx(mT — nT) + w(mT), (10.3.2 
nm 
or more simply, 
Ym = X0Am + Ý, AnXm-n + Wm, (10.3.3 
nm 


where x(t) = gr(t) * gr(t) and w(t) is the output response of the matched filter to th 
input AWGN process n(t). 

The middle term on the RHS of Equation (10.3.3) represents the ISI. The amour 
of ISI and noise that is present in the received signal can be viewed on an oscilloscope 
Specifically, we may display the received signal on the vertical input with the horizont: 
sweep rate set at 1/7. The resulting oscilloscope display is called an eye pattern becaus 
of its resemblance to the human eye. Examples of two eye patterns, one for binary PAN 
and the other for quaternary (M = 4) PAM, are illustrated in Figure 10.8(a). 
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Binary Quaternary 


(a) 


Optimum sampling 
time 


Sensitivity Distortion of 
to timing zero crossings 
; i 
error 





Noise margin 


Peak distortion 
(b) 


Figure 10.8 Eye patterns. (a) Examples of eye patterns for binary and quaternary PAM and (b) the effect of 
ISI oneye opening. 


The effect of ISI is to cause the eye to close, thereby reducing the margin for additive 
noise to cause errors. Figure 10.8(b) illustrates the effect of ISI in reducing the opening of 
the eye. Note that ISI distorts the position of the zero crossings and causes a reduction in 
the eye opening. As a consequence, the system is more sensitive to a synchronization error 
and exhibits a smaller margin against additive noise. 


Example 10.3.1 . 


Consider a binary PAM system that transmits data at arate of 1/ T bits/sec through an ideal 
channel of bandwidth W. The sampled output from the matched filter at the receiver is 


Ym = Am + Q.2am-1 — 0.3€n-2 + Wm, 
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where am = +1, with equal probability. Determine the peak value of the ISI and the noise 
margin, as defined in Figure 10.8(b). 


Solution If we compare the matched filter output ym with that given by Equation (10.3.3), i 
is apparent that x9 = 1, xı = 0.2, x2 = —0.3, and x, = 0, otherwise. The peak value of the 
ISI occurs when am-1 = —d,—2, So that the ISI term will take the peak value of +0.5. Since 
Xo = 1 and am = +1, the ISI causes a 50% reduction in the eye opening at the sampling time: 
t =mT, m = 0, +1, £2,.... Hence, the noise margin is reduced by 50% to a value of 0.5 
Thus, compared to the case in which there is no ISI, a noise component that is 50% smalle 
will cause an error at the detector. 1 


Next, we consider the problem of signal design under two conditions, namely, tha 
there is no ISI at the sampling instants and that a controlled amount of ISI is allowed. 


10.3.1 Design of Bandlimited Signals for Zero ISI—The Nyquist Criterion 


Let us consider a digital communication system as previously shown in Figure 10.4. Th 
signal component x(t) = gr (t) * c(t) * gr(t) at the output of the receiving filter, expresse: 
in the frequency domain, is 


X(f) = GANCANG) 
= Gr(f)Gr(f)Coe ?*F (10.3.4 
= Gr(f)Gr(f), 


where Gr(f) and Gr(f) denote the frequency responses of the transmitter and receive 
filters and C(f) = Co exp(—j2ufto), |f| < W denotes the frequency response of th 
channel. For convenience, we set Co = 1 and to = 0. We have also seen that the output c 
the receiving filter, when sampled periodically att = mT,m = ..., —2 — 1,0,1,2... 
yields the expression given by Equation (10.3.3). In this equation, the first term on the RH 
of the equation is the desired symbol, the second term constitutes the ISI, and the third ten 
is the additive noise. 

To remove the effect of ISI, it is necessary and sufficient that x(mT — nT) = 0 fc 
n # m and x(0) + 0, where without loss of generality, we can assume x(0) = 1. Th 
means that the overall communication system has to be designed such that 


x(nT) = | n a ; (10.3.! 


In this section, we derive the necessary and sufficient condition for X ( f) so x(t) can satis! 
the preceding relation. This condition is known as the Nyquist pulse-shaping criterion ' 
Nyquist condition for zero ISI. 


Nyquist Condition for Zero ISI. A necessary and sufficient condition for x( 
to satisfy 


x(aT) = | F : a (10.3. 
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is that its Fourier transform X (f) must satisfy 


foe) 


m 
> Xx (s + =) =T. (10.3.7) 
m=—00 
Proof. In general, x(t) is the inverse Fourier transform of X (f). Hence, 
o0 
x(t) = f X(f)" af. (10.3.8) 
—00 
At the sampling instants t = nT, this relation becomes 
OS, . 
x(nT) = f X(f) df. (10.3.9) 
—00 


Let us break the integral in Equation (10.3.9) into integrals covering the finite range of 
1/T. Thus, we obtain 


oo (Qm+1)/2T l 
x(nT) = > | X (fyei27 int df 
( 


ma oo Y (2m—1)/2T 


foe} 


yet [ œ i l 
J pa 


WZ gems 
1/2T 
= f Z(fyel?™I"T df, (10.3.10) 
—1/2T 


where we have defined Z (f) by 


Z(f) = 5 x(f+5) (10.3.11) 


m>=—0O 


Obviously, Z(f) is a periodic function with period $; therefore, it can be expanded in 
terms of its Fourier series coefficients {z,,} as 


CO 
Z= YO nT, (10.3.12) 
n=—00 
where 
1 
2T 5 
Zn = rf l Z(fye PF? df. (10.3.13) 
-2T 


Comparing Equations (10.3.13) and (10.3.10), we obtain 


Zn = Tx(—nT). (10.3.14) 
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Therefore, the necessary and sufficient conditions for Equation (10.3.6) to be satisfiec 
is that 


T n=0 
Zn = | Onai? (10.3.15: 


which, when substituted into Equation (10.3.12), yields 
Z(f)=T (10.3.16 
or, equivalently, 


> X (f + =) =T. (10.3.17 


m=—00 


This concludes the proof for the condition that X (f) must satisfy to obtain zero ISI. 
Now, suppose that the channel has a bandwidth of W. Then, C(f) = 0 for |f| > W 
consequently, X (f) = 0 for |f| > W. We distinguish three cases: 


1. In this case, T < Ww or, equivalently, 4 > 2W. Since Z(f) = yo X ( f+ 


T n=—00? 
consists of nonoverlapping replicas of X(f), which are separated by + as shown i 
Figure 10.9, there is no choice for X (f) to ensure Z( f) = T in this case. Thus, ther 
is no way that we can design a system with no ISI. 


2. In this case, T = zy or, equivalently, + = 2W (the Nyquist rate). The replication 
of X(f), separated by +, are about to overlap, as shown in Figure 10.10. It is cle: 


that there exists only one X (f) that results in Z(f) = T, namely, 


zin- È x(+ 8) 


= 0 


1 
Figure 10.9 Plotof Z(f) for the case T < ay 


zn- Sxe] 


la 





1 
Figure 10.10 Plot of Z(f) for the case T = ow" 


Section 10.3 Signal Design for Bandlimited Channels $ 561 


_[TIfI<W 
X(f) = | 0, otherwise (10.3.18) 
or X(f) =TII (4). which results in 
t 
x(t) = sinc (=) . (10.3.19) 


This means that the smallest value of T for which transmission with zero ISI is 
possible is T = zy; for this value, x(t) has to be a sinc function. The difficulty 
with this choice of x(t) is that it is noncausal and therefore nonrealizable. To make 
it realizable, usually a delayed version of it, i.e., sinc (52), is used and fp is chosen 
such that for t < 0, we have sinc (52) = 0. Of course with this choice of x(t), the 
sampling time must also be shifted to mT + tọ. A second difficulty with this pulse 
shape is that its rate of convergence to zero is slow. The tails of x(t) decay as 1/t; 
consequently, a small mistiming error in sampling the output of the matched filter at 
the demodulator results in an infinite series of ISI components. Such a series is not 
absolutely summable because of the 1/t rate of decay of the pulse; hence, the sum 


of the resulting ISI does not converge. 


3. In this case, for T > We Z(f) consists of overlapping replications of X(f) sepa- 
rated by i, as shown in Figure 10.11. In this case, there exists an infinite number of 


choices for X (f), such that Z(f) =T. 


For the T > zy case, a particular pulse spectrum that has desirable spectral properties and 
has been widely used in practice is the raised cosine spectrum. The raised cosine frequency 
characteristic (see Problem 10.5) is given as 


T, 0 < |f| < A -—aæ)/2T 
Xe(f) = (7 +008 (2 (11-58), ss o, 0320) 


On 
= 
V 





zn=-Žx- A] 





1 
Figure 10.11 Plot of Z(f) for the case T > ow" 
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where « is called the roll-off factor, which takes values in the range 0 < œ < 1. The 
bandwidth occupied by the signal beyond the Nyquist frequency 7 is called the excess 
bandwidth and is usually expressed as a percentage of the Nyquist frequency. For example, 
when a = 4, the excess bandwidth is 50%; when a = 1, the excess bandwidth is 100%, 
The pulse x (t) having the raised cosine spectrum is 


_ sinnt/T cos(nat/T) 


t) A A 
SeT aT 
3 cos(nat/T) 


Note that x(t) is normalized so that x(0) = 1. Figure 10.12 illustrates the raised cosine 
spectral characteristics and the corresponding pulses for a = 0, 1/2, 1. We note that fo; 
a = 0, the pulse reduces to x(t) = sinc(t/T) and the symbol rate is 1/T = 2W. Wher 
a = 1, the symbol rate is 1/T = W. In general, the tails of x(t) decay as 1/t? fora > 0 
Consequently, a mistiming error in sampling leads to a series of intersymbol interference 
components that converges to a finite value. 

Due to the smooth characteristics of the raised cosine spectrum, it is possible t 
design practical filters for the transmitter and the receiver that approximate the overal 
desired frequency response. In the special case where the channel is ideal witl 


c(f) = (4): we have 


Xre(f) = Gr(f)Gr(f). (10.3.22 


x(t) 





(b) 


Figure 10.12 Pulses having a raised cosine spectrum. 
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In this case, if the receiver filter is matched to the transmitter filter, we have X,.(f) = 
Gr(f)Gr(f) = |Gr(f)|?. Ideally, 


Gr(f) = ViXre( fle 7" (10.3.23) 
and Gr(f) = G} (f), where to is some nominal delay that is required to assure physi- 


cal realizability of the filter. Thus, the overall raised cosine spectral characteristic is split 
evenly between the transmitting filter and the receiving filter. We should also note that an 
additional delay is necessary to ensure the physical realizability of the receiving filter. 


Example 10.3.2 
An ideal channel has the frequency-response characteristic shown in Figure 10.13. Determine 
the frequency-response characteristics Gr (f) and Gr(f) of the transmit and receiver filters, 
such that Gr(f)Gr(f) = Xie(f), where X,-(f) is the raised cosine spectral characteristic 
given by Equation (10.3.20), and the desired roll-off factor is selected to be a = 1/2. Also, 
determine the symbol rate 1/7, and compare it with the Nyquist rate. 


Solution Since the passband of the channel is limited to |f| < 1200 Hz and œ = 1/2, we 
have 
lta 3/2 
2T 2T 
Hence, the symbol rate 1/T = 1600 symbols/sec. In contrast, the Nyquist rate is 2400 sym- 
bols/sec. The frequency response X(f) is given as (with T = 1/1600), 


= 1200. 


T, 0 < |f| < 400 
Xr(f) = { $ [1 + cos (4% (IfI — 400))], 400 < IfI < 1200. 
0, Ifl = 1200 


Then, 
IGr(f)| = IGRI = V Xl f). 


The phase characteristics of Gr(f) and Gr(f) can be selected to be linear, with Og(f) = 


—6r(f). a 
i ICO] 
—1200 0 1200 f 
(ZO) 
O(t) = —2x fto 
f 


Figure 10.13 Frequency response ofideal channel 
in Example 9.2.2. 
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10.3.2 Design of Bandlimited Signals with Controlled !SI—Partial-Response Signals 


As we have observed from our discussion of signal design for zero ISI, it is necessary to 
reduce the symbol rate 1/T below the Nyquist rate of 2W symbols/sec in order to realize 
practical transmitting and receiving filters. On the other hand, suppose we choose to relax 
the condition of zero ISI and, thus, achieve a symbol transmission rate of 2W symbols/sec. 
By allowing for a controlled amount of ISI, we can achieve this symbol rate. 

We have already seen that the condition of zero ISI is x(nT) = 0 forn 4 0. However, 
suppose that we design the bandlimited signal to have controlled ISI at one time instant. 
This means that we allow one additional nonzero value in the samples {x(nT)}. The ISI 
that we introduce is deterministic or “controlled”; hence, it can be taken into account at the 
receiver. We will discuss this case next. 

One special case that leads to (approximately) physically realizable transmitting and 
receiving filters is specified by the samples! 


1, n=0,1 
x(nT) = | 0, othérwisé” (10.3.24) 
Now, using Equation (10.3.14), we obtain 
T n=0,-1 
T | 0, otherwise ’ (10.3.25) 
which, when substituted into Equation (10.3.12), yields 
Z(f) =T +T eiT., (10.3.26. 


As in the preceding section, it is impossible to satisfy this equation for T < a However 
for T = >Ł, we obtain 


Ww? 
x= lawte], <w 
0, otherwise 
1 „—jnf/2W Í 
_ [pet cos (3$), Ifi< wW (10.3.27 
0, otherwise 


Therefore, x(t) is given by 
x(t) = sinc (2Wt) + sinc(2Wt — 1). (10.3.28 


This pulse is called a duobinary signal pulse. It is illustrated, along with its magnitude spec 
trum, in Figure 10.14. We note that the spectrum decays to zero smoothly, which mean 
that physically realizable filters can be designed to approximate this spectrum very closely 
Thus, a symbol rate of 2W is achieved. 

Another special case that leads to (approximately) physically realizable transmittin 
and receiving filters is specified by the samples 


‘Tt is convenient to deal with samples of x(t) that are normalized to unity for n = 0, 1. 
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x(t) 


Ayn 


Figure 10.14 Time-domain and 
frequency-domain characteristics of 
a duobinary signal. 





l, n=-l 
x(nT)={ -1, n=1 ; (10.3.29) 
0, otherwise 


The corresponding pulse x(t) is given as 
x(t) = sinc (¢+ T)/T — sinc (t —T)/T, (10.3.30) 
and its spectrum is 


e: se [eW — in f!W] = 
= Sai 10.3.31 
(f) 0, FI> W ( ) 


This pulse and its magnitude spectrum are illustrated in Figure 10.15. It is called a modified 
duobinary signal pulse. It is interesting to note that the spectrum of this signal has a zero 
at f = 0, making it suitable for wansmission over a channel that does not pass DC. 

We can obtain other interesting and physically realizable filter characteristics by 
selecting different values for the samples {x(nT)} and by selecting more than two nonzero 
samples. However, as we select more nonzero samples, the problem of unraveling the con- 
trolled ISI becomes more cumbersome and impractical. 

In general, the class of bandlimited signals pulses that have the form 


SS, (tL) sina WOE = 0/20) 
x)= D0 aC) 2nW(t —n/2W) 
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x(t) 


Figure 10.15 Time-domain and 
frequency-domain characteristics 
of a modified duobinary signal. 





and their corresponding spectra 


1 ro) . 
L x (st) PW, F< W 
x=] 2W 2 iara ay) 


0, Ifl > W 


arecalled partial response signals when controlled ISI is purposely introduced by selecting 
two or more nonzero samples from the set {x(n/2W)}. The resulting signal pulses allow us 
to transmit information symbols at the Nyquist rate of 2W symbols/sec. The detection o! 
the received symbols in the presence of controlled ISI is described in the following section 


10.4 DETECTION OF PARTIAL-RESPONSE SIGNALS 


When the transmitter and receiver filters Gr (f) and Gr(f) are designed for zero ISI, the 
detectors for various modulation methods described in the last two chapters apply withou 
modification and their error rate performance in additive white Gaussian noise is the samı 
as previously derived in these two chapters. In this section, we consider the detection o 
data symbols in the presence of controlled ISI and evaluate the error rate performance o 
the detector. 
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10.4.1 Symbol-by-Symbol Detection 


In this section, we describe a symbol-by-symbol method for detecting the information 
symbols at the demodulator for PAM when the received signal contains controlled ISI. 
This symbol detection method is relatively easy to implement. A second method, based on 
the maximum -likelihood (ML) criterion for detecting a sequence of symbols, is described 
in Section 10.4.3. This second method minimizes the probability of error but is a little 
more complex to implement. In particular, we consider the detection of the duobinary 
and the modified duobinary partial response signals. In both cases, we assume that the 
desired spectral characteristic X (f) for the partial response signal is split evenly between 
the transmitting and receiving filters, i.e., |Gr(f)| = |Gr(f)| = IX (AI. 

For the duobinary signal pulse, x(nT) = 1, for n = 0, 1 and zero otherwise. Hence, 
the samples at the output of the receiving filter have the form 


Ym = bm + Wm = Am + Am-i + Wm, (10.4.1) 


where {a,,} is the transmitted sequence of amplitudes and {wm} is the sequence of additive 
Gaussian noise samples. Let us ignore the noise for the moment and consider the binary 
case where am = +1 with equal probability. Then, bm takes on one of three possible 
values, namely, bm = —2, 0, 2, with corresponding probabilities 1/4, 1/2, 1/4. If a,,—1 is 
the detected symbol from the signaling interval beginning at (m — 1), its effect on bm, 
the received signal in the mth signaling interval, can be eliminated by subtraction, thus 
allowing am to be detected. This process can be repeated sequentially for every received 
symbol. 

The major problem with this procedure is that errors arising from the additive noise 
tend to propagate. For example, if the detector makes an error in detecting a,,_1, its effect 
on bn is not eliminated; in fact, it is reinforced by the incorrect subtraction. Consequently, 
the detection of am is also likely to be in error. 

' Error propagation can be avoided by precoding the data at the transmitter instead of 
eliminating the controlled ISI by subtraction at the receiver. The precoding is performed 
on the binary data sequence prior to modulation. From the data sequence {d,} of ones 
and zeros that is to be transmitted, a new sequence {pn}, called the precoded sequence, is 
generated. For the duobinary signal, the precoded sequence is defined as 


Pm = dm © Pm-1, m= 1l,2,..., (10.4.2) 


where the symbol © denotes modulo-2 subtraction.” Then, we set am = — 1 if Pm = 0, and 
am = 1 if Pm = 1, i.€., am = 2Pm — 1. 
The noise-free samples at the output of the receiving filter are given as 


bm = am + am-1 
= (2Pm — 1) + (2Pm-1 — 1) (10.4.3) 
= 2(Pm + Pm-1— 1). 


? Although this is identical to modulo-2 addition, it is convenient to view the precoding operation for 
duobinary in terms of modulo-2 subtraction. In the M-ary case, modulo-M addition and subtraction are clearly 
different. 
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Consequently, 
b 
Pm + Pm-1 = F +1. (10.4.4) 


Since dm = Pm ® Pm-1, it follows that the data sequence d, is obtained from bm by using 
the relation 


bm 
dm = > +1 (mod 2). (10.4.5) 


Consequently, if bm = 2, dm = O and if bm = 0, dm = 1. The effect of precoding is clear 
from Equation (10.4.5). The received level for the mth transmission D,, is directly related 
to dm, the data at the same transmission time. Therefore, an error in reception of bm only 
affects the corresponding data dm, and no error propagation occurs. 


Example 10.4.1 
For the binary data sequence {d,} given as 


111010010001101, 


determine the precoded sequence {pn}, the transmitted sequence {an}, the received sequence 
{bn}, and the decoded sequence {dn}. 


Solution By using the Equations (10.4.2), (10.4.3), and (10.4.5), we obtain the desired 
sequences, which are given in Table 10.1. E 


In the preceding derivation, we neglected the effect of the additive noise on the detec- 
tion method. In the presence of additive noise, the sampled outputs from the receiving 
filter are given by Equation (10.4.1). In this case, yn = bm + Wm is compared with the 
two thresholds set at +1 and —1. The data sequence {d„} is obtained according to the 
detection rule 


dn = | 6 if —1<Ym <1 (10.4.6) 


The extension from binary PAM to multilevel PAM signaling using the duobinary 
pulses is straightforward. In this case, the M-level amplitude sequence {a,,} results in a 


(noise-free) sequence 


bm = Gm +am-1, m=1,2,..., (10.4.7) 


TABLE 10.1 BINARY SIGNALING WITH DUOBINARY PULSES 


Data sequence d, 1 1 1 0 1 0 0 10 0 0 1 1 0 1 
Precoded sequence Pp, 0 1 0O 1 1 0 0 O 1 1 1 1 O 1 1 0 
Transmitted sequencea, —-1 1 -1 1 1 -l -1 -1 1 1 1 1 ~-1 1 1 -l 
Received sequence b, 0 0 0 2 0 -2 -2 0 2 2 2 0 0 2 0 
Decoded sequence d,, 1 1 1 0 1 0 0 1 0 0 0 1 1 0 1 





Doke eaa ta EE a eS ik re a 
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which has 2M — 1 possible equally spaced levels. The amplitude levels are determined 
from the relation 


Am = 2Pm — (M — 1), (10.4.8) 
where {Pm} is the precoded sequence that is obtained from an M-level data sequence {dm} 
according to the relation 


Pm = dm © Pm—1 (mod M), (10.4.9) 


where the possible values of the sequence {dm} are 0, 1, 2,..., M — 1. 
In the absence of noise, the samples at the output of the receiving filter may be 
expressed as 


bm = Gn + Qm-1 


= [pm + Pm-ı — (M — 1)]. (10.4.10) 
Hence, 
Pm + Pm-1 = e +(M — 1). (10.4.11) 
Since dm = Pm + Pm-1, it follows that 
dm = "u + (M — 1) (mod M). (10.4.12) 


Here again, we see that error propagation has been prevented by using precoding. 
Example 10.4.2 
Consider the four-level data sequence {dp} 


0013120332010, 


which was obtained by mapping two bits into four-level symbols, i.e., 00 > 0, 01 > 1, 
10 — 2, and 11 — 3. Determine the precoded sequence {pq}, the transmitted sequence {a,}, 
the received sequence {b,,}, and the decoded sequence {d,,}. 


Solution By using Equations (10.4.7) through (10.4.12), we obtain the desired sequences, 
which are given in Table 10.2. a 


In the presence of noise, the received signal-plus-noise is quantized to the nearest 
possible signal level and the preceding rule is used on the quantized values to recover the 
data sequence. 


TABLE 10.2 FOUR-LEVEL TRANSMISSION WITH DUOBINARY PULSES 


Data sequence d,, 0 0 1-35 FT. .2 0 3 3 2 0 1 0 
Precoded sequence p,, 0 0 0 Tt. 2. 3° 33 1 2 i 3 2- 2 
Transmitted sequence an —3 -3 -3 -1 1 3 3 -1 1 -l1 -l1 3 1 1l 
Received sequence b, —6 -6 -4 0 4 6 2 0 0 -2 2 4 2 
Decoded sequence d,, 0 0 I 3. ©. 2 0 3 3 2 0 1 0 
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In the case of the modified duobinary pulse, the controlled ISI is specified by the 
values x(n/2W) = —1 for n = 1, x(n/2W) = 1 for n = —1, and zero otherwise. 
Consequently, the noise-free sampled output from the receiving filter is given as 


bm = am — An—2, (10.4.13) 


where the M-level sequence {an} is obtained by mapping a precoded sequence according 
to the relation Equation (10.4.8) and 


Pm = dm ® Pm—2 (mod M). . (10.4.14) 


From these relations, it is easy to show that the detection rule for receiving the data 
sequence {dm} from {bm} in the absence of noise is 


bm 
dm = (mod M). (10.4.15) 


As demonstrated, the precoding of the data at the transmitter makes it possible to 
detect the received data on a symbol-by-symbol basis without having to look back at pre- 
viously detected symbols. Thus, error propagation is avoided. 

The probability of error of the symbol-by-symbol detector previously described is 
determined in the following section. 


10.4.2 Probability of Error for Symbol-by-Symbol Detection 


In this section, we determine the probability of error for the symbol-by-symbol detection oi 
digital M-ary PAM signaling using duobinary and modified duobinary pulses. The channe. 
is assumed to be an ideal bandlimited channel with additive white Gaussian noise. The 
model for the communication system is shown in Figure 10.16. 

Atthe transmitter, the M-level data sequence {d,,} is precoded as previously describe 
The precoder output is mapped into one of M possible amplitude levels. Then the trans: 
mitting filter with frequency response G7(f) has an output i 


vt) = J. angr(t — nT). (10.4.16 


n=—00 
M-level Output 
= Precoder —>| Detector |}—-——- 


Figure 10.16 Block diagram of the modulator and demodulator for partial response signals. 
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The partial-response function X (f) is divided equally between the transmitting and receiv- 
ing filters. Hence, the receiving filter is matched to the transmitted pulse, and the cascade 
of the two filters results in the frequency characteristic 


IGr(f)Gr(f)| =X). (10.4.17) 


The matched filter output is sampled at t = nT = n/2W and the samples are fed to the 
decoder. For the duobinary signal, the output of the matched filter at the sampling instant 
may be expressed as 


Ym = An +Am—1 + Wm 


(10.4.18) 
= bn + Wm, 


where Wm is the additive noise component. Similarly, the output of the matched filter for 
the modified duobinary signal is 


Ym = Am — An—2 + Wm 


10.4.19 


For binary transmission, let a,, = +d, where 2d is the distance between signal levels. 
Then, the corresponding values of b,, are (2d, 0, —2d). For M-ary PAM signal transmis- 
sion, where am = +d,+3d,...,+(M — 1)d, the received signal levels are bm = O, 
2d, 4d, ...,=:2(M — 1)d. Hence, the number of received levels is 2M — 1. The input 
transmitted symbols {a,,} are assumed to be equally probable. Then, for duobinary and 
modified duobinary signals, it is easily demonstrated that, in the absence of noise, the 
received output levels have a (triangular) probability mass function of the form 


M —|m| 

M? ’ 
where b denotes the noise-free received level and 2d is the distance between any two 
adjacent received signal levels. l 

The channel corrupts the signal transmitted through it by the addition of white Gaus- 

sian noise with zero mean and a power spectral density No/2. We assume that a sym- 
bol error is committed whenever the magnitude of the additive noise exceeds the distance 
d. This assumption neglects the rare event that a large noise component with magnitude 
exceeding d may result in a received signal level that yields a correct symbol decision. The 
noise component w,, is zero-mean Gaussian with variance 


P(b =2md) = m = 0, +1, +2, ...+ (M — 1), (10.4.20) 


N, w 
=> / IGRA)? af 
-W 


No [Y 
=a | |XOldf = 2No/7, (10.4.21) 
-W 
where we have used Equations (10.3.27) and (10.3.31) to compute the integral. Equation 
(10.4.21) applies to both duobinary and modified duobinary signals. Hence, an upper bound 
on the symbol probability of error is 
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M-2 
Pu< >> P(ly—2md| > d|b = 2md)P(b = 2md) 
m=—(M-—2) 


+2P(y+2(M — 1)d > d|b = —2(M — 1)d)P (b = —2(M — 1)d) 


M-\ 
= P(|y| > dib = 0) E > P(b = 2md) — P(b = 0) — P(b = —2(M — Da) 





m=0 
1 
= (1 — a) P(\y| > d|b = 0). (10.4.22) 
But 
2 9 2 jn 2 
P(\y| > dlb =0) = f e™ 12w dx 
ly! V2NCy Jd 
xd? 
=2 — |]. : 10.4.2 

Q No ( 3) 


Therefore, the average probability of a symbol error is upper bounded as 


1 nd? 
2{1- — — |. 10.4.24 
Pu < ( i) Q i IN, ( f 


The scale factor d in Equation (10.4.24) can be eliminated by expressing d in terms o 
the average power transmitted into the channel. For the M-ary PAM signal in which the 
transmitted levels are equally probable, the average power at the output of the transmittin; 
filter is 





E (a?) x 2 
Py =S [ier df 





WwW 
= J IX (Pdf 
-wW 


4 2 
= — 10.4.25 
ae (a2), ( 


where E (a?) is the mean square value of the M signal levels, which is 


rage 
E (a,) = a“. (10.4.2€ 


Therefore, 


yon 3m PayT 


oh ea, (10.4.2; 
4(M2 — 1) 
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By substituting the value of d? from Equation (10.4.27) into Equation (10.4.24), we obtain 
the upper bound for the symbol error probability as 


Py <2(1-—.)o aes Cav 10.4.28 
“<i Tm 4) M-N | (Ona 


where Gay = PaT is the average energy/transmitted symbol, which can also be expressed 
in terms of the average bit energy as ay = kĉpay = (log2M) tay. 

The expression in Equation (10.4.28) for the probability of error of M-ary PAM 
holds for both a duobinary and a modified duobinary partial response signal. If we compare 
this result with the error probability of M-ary PAM with zero ISI, which can be obtained 
by using a signal pulse with a raised cosine spectrum, we note that the performance of par- 
tial response duobinary or modified duobinary has a loss of (11/4)? or 2.1 dB. This loss in 
SNR is due to the fact that the detector for the partial response signals makes decisions on 
a symbol-by-symbol basis; thus, it ignores the inherent memory contained in the received 
signal at the input to the detector. 


10.4.3 Maximum-Likelihood Sequence Detection of Partial-Response Signals 


When the received signal sequence has no memory, the symbol-by-symbol detector that 
was described in Sections 8.3.3 and 8.4.1 is optimum in the sense of minimizing the 
probability of a symbol error. On the other hand, when the received symbol sequence has 
memory, i.e., the received symbols in successive symbol-time intervals are statistically 
interdependent, the optimum detector bases its decisions on the observation of a sequence 
of received symbols over successive symbol-time intervals. 

The sequence of received symbols resulting from the transmission of partial response 
signal waveforms is characterized as a sequence having memory between successive sym- 
bols. To observethe memory in the received sequence, let us look at the noise-free received 
sequence for binary transmission given in Table 10.1. The sequence {bm} is 0, 0, 0, 2, 0, —2, 
—2,0,2,2.... We note that it is not possible to have a transition from —2 to +2 or from 
+2 to —2 in one symbol interval. For example, if the signal level at the input to the detec- 
tor is —2, the next signal level can be either —2 or 0. Similarly, if the signal level at a 
given sampling instant is 2, the signal level in the next time instant can be either 2 or 0. In 
other words, it is not possible to encounter a transition from —2 to 2 or vice versa between 
two successive received samples from the matched filter. However, a symbol-by-symbol 
detector does not exploit this constraint or inherent memory in the received sequence. 

The memory that is inherent in the received sequence {y,,} resulting from transmis- 
sion of a partial response signal waveform is conveniently represented by a trellis diagram. 
For example, the trellis for the duobinary partial response signal for binary data transmis- 
sion is illustrated in Figure 10.17. For binary modulation, this trellis contains two states; 
each state correspond to the two possible input values of am, i.€., &4m = +1. Each branch in 
the trellis is labeled by two numbers. The first number on the left is the new data bit, i.e., 
Qm+1 = +1. This number determines the transition to the new state. The number on the 
right is the received signal level. 
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t —1/-2 —1/-2 —1/-2 
f Í Figure 10.17 Trellis for duobinary partial 


t=0 t=T t=2T t=3T response signals. 


The duobinary signal has a memory of length L = 1. Hence, for binary modulation, 
the trellis has S; = 2 states. For M-ary PAM modulation, the number of trellis states is M. 

The optimum sequence detector that exploits the memory inherent in the received 
sequence of symbols may be based on the maximum a posteriori probability (MAP) crite- 
rion (see Section 8.4.1). For example, consider the transmission of a sequence of N sym- 
bols {am, m = 1, 2,..., N}. If each symbol can take one of M possible values, then there 
are MN possible transmitted sequences, denoted as a®, 1 < k < M”. The received 
sequence is {ym, 1 < m < N}, and it is denoted as yy. A detector that bases its decision 
on the MAP criterion computes a posteriori probabilities 


P(a® was transmitted |yy), 1 < k < M” (10.4.29) 


and selects the particular sequence that yields the highest probability. But the probabilities 
in Equation (10.4.29) can be expressed as 


(k) (k) 
P (a? was transmitted | yy) = f (pulak ) P (ai?) (10.4.30) 
"y a i fw) f B 


When all the symbol sequences a® are equally likely to be transmitted, which is usually 


the case in practice, selecting the sequence that maximizes the a posteriori probability is 
equivalent to finding the sequence a that maximizes the conditional probability density 
function (PDF) f ( YN las) over all possible M” sequences. Thus, the optimum detection 
criterion is the maximum-likelihood criterion. 

The optimum ML sequence detector computes M™ probabilities, which includes one 
probability for each of the M™ possible transmitted sequences. When M and N are large, 
the computational complexity of the ML detector becomes prohibitive. However, a compu- 
tationally efficient algorithm for performing ML sequence detection, invented by Andrew 
Viterbi during the late 1960s, allows us to reduce the computational burden by eliminating 
sequences as new data is received. In effect, the Viterbi algorithm (VA) is a sequential trel- 
lis search algorithm, which is described in more detail in Chapter 13 (Section 13.3.2) as a 
decoding algorithm for convolutional codes. 

In the case of the trellis shown in Figure 10.17 for the duobinary partial response 
signal, we observe that there are two states, labeled as +1 and —1, as well as nodes having 
two incoming signal paths and two outgoing signal paths. At each node of the trellis, the 
VA computes two probabilities (two metrics) corresponding to each of the two incoming 
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signal paths. One of the two paths is selected as the more probable (based on the values 
of the corresponding probabilities) and the other path is discarded. The surviving path at 
each node is then extended to two new paths, which includes one path for each of the 
two possible input symbols, and the search process continuous. Thus, the VA reduces the 
computational complexity of the ML detector. 

For the class of partial response signals, the received sequence {ym, 1 < m < N} is 
generally described statistically by thejoint PDF f(y, |an), where yy =(y1, Y2; -<-> Yn)’ 


and ay = (aj, a2, ..., ay)! and N > L. When the additive noise is zero-mean Gaussian,’ 
Ft (yn |an) is a multivariate Gaussian PDF; i.e., 
Ff (yy lan) = -20N -bn C! on bw), (10.4.31) 


(Qn) NAC)" 


where by = (bi, b2,..., by)’ is the mean of the vector yy and C is the N x N covariance 
matrix of yy. Then, the ML sequence detector selects the sequence through the trellis that 
maximizes the PDF f(y, |an). 

The computations for finding the most probable sequence through the trellis is sim- 
plified by taking the natural logarithms of f (yy |an). Thus, 


N 1 
In f(Yy |an) = -7 POT) — 5 Inidet(©)| — Ow — by)'C™' (yy — by)/2. (10.4.32) 


Given the received sequence {ym}, the data sequence {am} that maximizes In f (yy | ay) is 
identical to the sequence {am} that minimizes (yy — by)‘ C (yy — by); ice., 


ay = arg min( yy — by)' C! (yy — by). (10.4.33) 


The search through the trellis for the minimum distance path is performed sequen- 
tially by use of the Viterbi algorithm. Let us consider the duobinary signal waveform with 
binary modulation and suppose that we begin at the initial state with ag = 1. Then upon 
receiving yı = a; + ao + w at time t = T and y2 = a + q + w3 at time t = 2T, we have 
four candidate paths, corresponding to (a1, a2) = (1, 1), (—1, 1), (1, —1), and (—1, —1). 
The first two candidate paths merge at state 1 at £ = 2T. For the two paths merging 
at state 1, we compute the metrics jz2(1, 1) and “2(—1, 1) and select the more proba- 
ble path. A similar computation is performed at state —1 for the two sequences (1, —1) 
and (—1, —1). Thus, one of the two sequences at each node is saved and the other is dis- 
carded. The trellis search continues upon receipt of the signal sample y; at time t = 3T, 
by extending the two surviving paths from time t = 2T. 

The metric computations are complicated by the correlation of the noise samples at 
the output of the matched filter for the partial response signal. For example, in the case 
of the duobinary signal waveform, the correlation of the noise sequence {wm} is over two 
successive signal samples. Hence, wm and w+, are correlated for k = 1 and uncorrelated 
for k > 1. In general, a partial response signal waveform with memory L will result in a 
correlated noise sequence at the output of the matched filter, which satisfies the condition 
E(WmWm+k) =0 fork > L. 


3To simplify the notation, we omit the superscript k from the following development. 
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The ML sequence detection introduces a variable delay in detecting each transmitted 
information symbol. In practice, the variable delay is avoided by truncating the surviving 
sequences to N; most recent symbols, where N, > 5L, thus achieving a fixed delay. In case 
the M“ surviving sequences at time t = mT disagree on the symbol a,,_y,, the symbol in 
the most probable surviving sequence may be chosen. The loss in performance resulting 
from this truncation is negligible if N, > 5L. 


10.4.4 Error Probability of the Maximum-Likelihood Sequence Detector 


In general, the computation of the exact probability of error is very difficult. Instead, we 
shall determine an approximation to the probability of error, which is based on comparing 
the metrics of two paths which merge at a node and which are separated by the smallest 
Euclidean distance of all other paths. Our derivation is performed for the duobinary partial 
response signal waveform. 

Let us consider the trellis for the duobinary partial response signal shown in Figure 
10.17. We assume that we start in state 1 at t = 0 and that the first two transmitted symbols 
are a, = 1 and a = 1. Then, at t = T we receive yy = 2d + w and at t = 2T we 
receive y2 = 2d + w2. An error is made at state 1 if the path (a1, a2) = (—1, 1) is more 
probable than the path (a1, a2) = (1, 1), given the received values of y} and y2. This path 
error event is the dominant path error event and, hence, it serves as a good approximation 
to the probability of error for the ML sequence detector. 

From Equation (10.4.32), the metric for the path (a), a2) = (1, 1) is 


= — 2d 
bel, 1) = [yı —2d y2 — 2d]C7! i | ; (10.4.34) 
y — 2d 
where the covariance matrix C is given by (see Problem 10.35) 
2M% |1 4 
caan i} 1 l (10.4.35) 
x |; 1 l 
For the path (a1, a2) = (—1, 1), the corresponding metric is 
Ha(-1,1) =[y1 y] | À : | . (10.4.36) 


The probability of a path error event is simply the probability that the metric 42(—1, 1) is 
smaller than the metric 22(1, 1); i.e., 


Po = Pluo(—1, 1) < p(l, 1)]. (10.4.37) 


By substituting yı = 2d + w; and y2 = 2d + w2 into Equations (10.4.34) and (10.4.36) 
we find that 


P, = P(w, +w < —2d). (10.4.38) 
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Since w, and w2 are zero-mean (correlated) Gaussian variables, their sum is also zero- 
mean Gaussian. The variance of the sum z = w; + w2 is simply o? = 16N0o/3x. Therefore, 


| c) 
P) = P(z < —2d) = Q (=) =Q ( ikk (10.4.39) 


From Equation (10.4.27) we have (with M = 2) the expression for d? as 


wPywT np 
feo =, A. 
4 A (10.4.40) 


Hence, the probability of the path error event is 


= 1.502 (2%, 
n= ol) v (+). (10.4.41) 


First, we note that this path error event results in one bit-error in the sequence of 
two bits. Hence, the bit-error probability is P2/2. Second, there is a reduction in SNR of 
10 log(1.5m*/16) = —0.34 dB relative to the case of no intersymbol interference. This 
small SNR degradation is apparently the penalty incurred in exchange for the bandwidth 
efficiency of the partial response signal. Finally, we observe that the ML sequence detector 
has gained back 1.76 dB of the 2.1 dB degradation inherent in the symbol-by-symbol 
detector. 








10.5 SYSTEM DESIGN IN THE PRESENCE OF CHANNEL DISTORTION 


In Section 10.3.1, we described a signal design criterion that results in zero ISI at the 
output of the receiving filter. Recall that a signal pulse x(t) will satisfy the condition of 
zero ISI at the sampling instants t = nT,n = +1, +2,..., if its spectrum X (f) satisfies 
the condition given by Equation (10.3.7). From this condition, we concluded that, for ISI- 
free transmission over a channel, the transmitter—receiver filters and the channel transfer 
function must satisfy 


Gr(f)CP)Gr(f) = Xxl), (10.5.1) 


where X,,( f) denotes the Fourier transform of an appropriate raised cosine pulse, whose 
parameters depend on the channel bandwidth W and the transmission interval T. In this 
section, we are concerned with the design of a digital communication system that sup- 
presses ISI in a channel with distortion. We first present a brief coverage of various types 
of channel distortion, and then we consider the design of transmitter and receiver filters. 

We distinguish two types of distortion. Amplitude distortion results when the ampli- 
tude characteristic |C(f)| is not constant for |f| < W, as illustrated in Figure 10.18(a). 
The second type of distortion, called phase distortion, results when the phase characteristic 
6-(f) is nonlinear in frequency, as illustrated in Figure 10.18(b). 
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IC] OAF) 





(b) 


Figure 10.18 Channel characteristics illuswating (a) amplitude distortion and (b) phase distortion. 


Another view of phase distortion is obtained by considering the derivative of 0;( f). 
Thus, we define the envelope delay characteristic as we did in Problem 2.66: 


_ -146-(f) 
A= ar 


When 6,(f) is linear in f, the envelope delay is constant for all frequencies. In this case 
all frequencies in the transmitted signal pass through the channel with the same fixed time 
delay. In such a case, there is no phase distortion. However, when 6,(f) is nonlinear, the 
envelope delay t(f) varies with frequency and the various frequency components in thr 
input signal undergo different delays in passing through the channel. In such a case, wi 
say that the transmitted signal has suffered from delay distortion. 

Both amplitude and delay distortions cause intersymbol interference in the receive 
signal. For example, let us assume that we have designed a pulse with a raised cosine spec 
trum that has zero ISI at the sampling instants. An example of such a pulse is illustrated i1 
Figure 10.19(a). When the pulse is passed through a channel filter with a constant ampli 
tude |C(f)| = 1 for |f| < W and a quadratic phase characteristic (linear envelope delay’ 
the received pulse at the output of the channel is as shown in Figure 10.19(b). Note that th 
periodic zero crossings have been shifted by the delay distortion, so that the resulting puls 
suffers from ISI. Consequently, a sequence of successive pulses would be smeared into on 
another, and the peaks of the pulses would no longer be distinguishable due to the ISI. 

In the next section, we consider two design problems. First, we consider the desig 
of transmitting and receiving filters in the presence of channel distortion when the channe 
characteristics are known. Second, we consider the design of special filters, called channe 
equalizers, that automatically and adaptively correct for the channel distortion when th 
channel characteristics, i.e., |C(f)| and 6,(f), are unknown. 





(10.5.2 


10.5.1 Design of Transmitting and Receiving Filters for a Known Channel 


In this section, we assume that the channel frequency-response characteristic C(f) 
known; thus, we consider the problem of designing a transmitting filter and a receivir 
filter that maximize the SNR at the output of the receiving filter and result in zero IS 
Figure 10.20 illustrates the overall system under consideration. 
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Figure 10.19 Effect of 
channel distortion in (a) channel 





(b) input and (b) channel output. 
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Figure 10.20 System configuration for the design of G;(f) and Ga(f). 


For the signal component, we must satisfy the condition 


Gr(f)C(A)GRr(f) = Xol fje "O, = | fl < Ww, (10.5.3) 


where X;c(f) is the desired raised cosine spectrum that yields zero ISI at the sampling 
instants and f is a time delay, which is necessary to ensure the physical realizability of the 
transmitter and receiver filters. 
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The noise at the output of the receiving filter may be expressed as 
o0 
w(t) = n(t — T)gr(t) dt, (10.5.4) 
— 0 


where n(t) is the input to the filter. The noise n(t) is assumed to be zero-mean Gaussian. 
Hence, w(t) is zero-mean Gaussian, with a power spectral density 


Swf) = HAIER, (10.5.5) 


where S,,(f) is the spectral density of the noise process n (t). 
For simplicity, we consider binary PAM transmission. Then, the sampled output of 
the matched filter is 





Ym = X0Am + Wm = am + Wm, (10.5.6) 


where xo is normalized to unity, a, = +d, and Wm represents the noise term that is zero- 
mean Gaussian with variance 


foe) 
o2 = f SAIGR AE df. 40.57) 
=00 
Consequently, the probability of error is 
1 [ a 2 d? 
P, = — e*%Pdy=Q| /—). 10.5.8 
? V2 Jais os i l 


Now, suppose that we select the filter at the transmitter to have the frequency response 


vV X(f) e`i2T fto 
C(f) 


where tọ is a suitable delay to ensure causality. Then, the cascade of the transmit filter anc 
the channel results in the frequency response 


Gr (ACES) =V Xr f) e774. - (10.5.10 


In the presence of additive white Gaussian noise, the filter at the receiver is designed to be 
matched to the received signal pulse. Hence, its frequency response is 


Gr(f) = v X(f) a a (10.5.11 


where t, is an appropriate delay. 
Let us compute the SNR d*/o2 for these filter characteristics. The noise variance is 


Gr(f) = (10.5.9 











co Ww 
oy = zf IGr(f) Aaf = mal Xie(f) = a (10.5.12 
00 w 


The average power transmitted is 


Ee) [® , d f” ES) 
Gs [oan w ICP 


; 10.5.13 
F df ( 
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W Xc(f) I 
2 = -=d ; 
d If, IC(f)? f 


Therefore, the SNR d*/o2 is given as 


hence, 








d _2PaT l t Xn(f) y ale 


2 No WU wIiCAP 


W 
pie J Xe) ap 
-W 


We note that the term 
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(10.5.14) 


(10.5.15) 


(10.5.16) 


with |C(f)| < 1 for|f| < W, represents the loss in performance in dB of the communica- 
tion system due to channel distortion. When the channel is ideal, |C (f)| = 1 for |f| < W; 
hence, there is no performance loss. We also note that this loss is entirely due to amplitude 
distortion in the channel because the phase distortion has been totally compensated by the 


transmit filter. 


Example 10.5.1 


Determine the magnitude of the transmitting and receiving filter characteristics for a binary 
communication system that transmits data at a rate of 4800 bits/sec over a channel with fre- 


quency response 





1 
CA =; flew. 
1+ (5) 
where W = 4800 Hz. The additive noise is zero-mean white Gaussian with a spectral density 
No/2. 
Solution Since W = 1/T = 4800, we use a signal pulse with a raised cosine spectrum and 
a = 1. Thus, 
T 
Xic(f) = 5 [1 +cos(nT|f|)] 
=T cos? lf | i 
9600 
Then, 


2 
erent (rfia ($) feos mA, |f| < 4800 Hz 


I\GR(f)| = VT cos A, |f| < 4800 Hz 


and 


IGr(f)|=!Ga(f)|=0  for|f|> 4800 Hz. 























582 Digital Transmission through Bandlimited Channels Chapter 19 


10.5.2 Channel Equalization 


In the preceding section, we described the design of transmitting and receiving filters for 
digital PAM transmission when the frequency-response characteristics of the channel are 
known. Our objective was to design these filters for zero ISI at the sampling instants. This 
design methodology is appropriate when the channel is precisely known and its character- 
istics do not change with time. 

In practice, we often encounter channels whose frequency-response characteristics 
are either unknown or change with time. For example, in data transmission over the dial- 
up telephone network, the communication channel will be different every time we dial a 
number because the channel route will be different. Once a connection is made, however, 
the channel will be time invariant for a relatively long period of time. This is an example of 
a channel whose characteristics are unknown a priori. Examples of time-varying channels 
are radio channels, such as ionospheric propagation channels. These channels are charac- 
terized by time-varying frequency response characteristics. When the channel characteris- 
tics are unknown or time varying, the optimization of the transmitting and receiving filters, 
as described in Section 10.5.1, is not possible. 

Under these circumstances, we may design the transmitting filter to have a square- 
root raised cosine frequency response, i.e., 


_ | VX e, fW 
arD = y ifl> Ww? 


and the receiving filter, with frequency response Gr(f), to be matched to Gr(f) 
Therefore, 


IGr(f)IIGr(f)| = Xre(f).- (10.5.17 


Then, due to channel distortion, the output of the receiving filter is 


oo 


y(t) = X` agx(t —nT) + w(t), (10.5.18 


n=—00 


where x(t) = gr(t)*c(t) *gr(t). The filter output may be sampled periodically to produc 
the sequence 


00 
Ym = > AnXm—n + Wm 


n=—0O 
+00 
= X0Qm + > AnXm-n + Wm, (10.5.16 
n=—0O 
n#m 


where x, = x(nT),n = 0, +1, +2,.... The middle term on the right-hand side « 
Equation (10.5.19) represents the ISI. 
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Figure 10.21 Equivalent discrete-time channel filter. 


In any practical system, it is reasonable to assume that the ISI affects a finite number 
of symbols. Hence, we may assume that x, = O for n < —L, and.n > Lo, where Lı and 
Lz are finite, positive integers. Consequently, the ISI observed at the output of the receiv- 
ing filter may be viewed as being generated by passing the data sequence {a,,} through an 
FIR filter with coefficients {x,, -—L1 < n < L2}, as shown in Figure 10.21. This filter is 
called the equivalent discrete-time channel filter. Since its input is the discrete information 
sequence (binary or M-ary), the output of the discrete-time channel filter may be character- 
ized as the output of a finite-state machine with L = Lı + L23 states, corrupted by additive 
Gaussian noise. Hence, the noise-free output of the filter is described by a trellis having 
ME states. 


Maximum-Likelihood Sequence Detection. The optimum detector for the 
information sequence {a,,}, which is based on the observation of the received sequence 
{Ym} and given by Equation (10.5.19), is an ML sequence detector. The detector is akin to 
the ML sequence detector described in the context of detecting partial response signals that 
have controlled ISI. The Viterbi algorithm described in Section 13.3.2 provides a method 
for searching through the trellis for the ML signal path. To accomplish this search, the 
equivalent channel filter coefficients {x,,} must be known or measured by some method. At 
each stage of the wellis search, there are M} surviving sequences with M“ corresponding 
Euclidean distance path metrics. 

Due to the exponential increase in the computational complexity of the Viterbi algo- 
rithm with the span (length L) of the ISI, this type of detection is practical only when M 
and L are small. For example, in mobile cellular telephone systems that employ digital 
transmission of speech signals, M is usually selected to be small, i.e., M = 2 or 4, and 
2 < L < 5. In this case, the ML sequence detector may be implemented with reason- 
able complexity. However, when M and L are large, the ML sequence detector becomes 
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impractical. In such a case, other more practical but suboptimum methods are used to detec 
the information sequence {am} in the presence of ISI. Nevertheless, the performance of th 
ML sequence detector for a channel with ISI serves as a benchmark for comparing its per 
formance with that of suboptimum methods. Two suboptimum methods are described nex: 


Linear Equalizers. For channels whose frequency-response characteristics ar 
unknown, we may employ a linear filter with adjustable parameters, which are adjusted t 
compensate for the channel distortion. Such a filter is called an equalizer. 

First, we consider the design characteristics for a linear equalizer from a frequency 
domain viewpoint. Figure 10.22 shows a block diagram of a system that employs a linez 
filter as a channel equalizer. 

The demodulator consists of a receiving filter with the frequency response GR( f) i 
cascade with a channel equalizing filter that has a frequency response Gg(f). Since GR (f 
is matched to Gr (f) and they are designed so that their product satisfies Equation (10.5.17 
|Gz(f)| must compensate for the channel distortion. Hence, the equalizer frequenc 
response must equal the inverse of the channel response, i.e., 


eee ae 
cf) Ich 


where |Ge(f)| = 1/|C(f)| and the equalizer phase characteristic 0g (f) = —@(f 
In this case, the equalizer is said to be the inverse channel filter to the channel response. 

We note that the inverse channel filter completely eliminates ISI caused by the chai 
nel. Since it forces the ISI to be zero at the sampling times t = nT, the equalizer is calle 
a zero-forcing equalizer. Hence, the input to the detector is of the form 


Ge(f) = IA | f| < W, (10.5.2 


Ym = An + Wm, 


where wm is the noise component, which is zero-mean Gaussian with a variance 


| Sn(fPIGRA)PIGEA)P af 


W Sal PAXP 
= —=— df, 10.5.2 
S IC(f)/? f i 





ee Transmitting Ch 1 Receiving Equali To detecto 
_— filter ——>- Cf) filter e O tT |_ > 
Gr(f) Gr(f) g 
Noise 
n(t) 


Figure 10.22 Block diagram of a system with equalizer. 
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in which S,(f) is the power spectral density of the noise. When the noise is white, S, (f) = 
No/2 and the variance becomes 


2 No fY IXP 
o? = o f Tel cone a (10.5.22) 


Example 10.5.2 


The channel given in Example 10.5.1 is equalized by a zero-forcing equalizer. Assuming that 
the transmitting and receiving filters satisfy Equation (10.5.17), determine the value of the 
noise variance at the sampling instants and the probability of error. 


Solution When the noise is white, the variance of the noise at the output of the zero-forcing 
equalizer (input to the detector) is given by Equation (10.5.22). Hence, 


2 No [™ IXe) 


= ——— d 
w= y CRY 


-f [1+ (fW)’] cos? IT 4 df 


1 
= No f (1 + x) cos? Heig 
0 2 


2 1 
=(3-a)™ 


The average transmitted power is 


=j d? 

Py = EDE f” IGr( fds 
— 1)d? 

= oe IXe(f) af 
(M? — 1)d? 
eee 


The general expression for the probability of error is given as 


p = 2M = Do 3PyT 
SS M (M2 — 1)(2/3 — 1/m2)No | ` 











If the channel were ideal, the argument of the Q-function would be r AA . Hence, the loss 
in performance due to the nonideal channel is given by the factor 2 (2/3 — 4) = 1.133 or 


0.54 dB. a 


Let us now consider the design of a linear equalizer from a time-domain viewpoint. 
We noted previously that in real channels, the ISI is limited to a finite number of samples, L 
samples. As a consequence, in practice for example, the channel equalizer is approximated 
by a finite-duration impulse response (FIR) filter, or transversal filter, with adjustable tap 
coefficients {c,} as illustrated in Figure 10.23. The time delay t between adjacent taps may 
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Unequalized 
input 







Equalized 
output 


“Al gorithm for tap 
gain adjustment z 


a Figure 10.23 Linear transversal filter. 


be selected as large as T, the symbol interval, in which case the FIR equalizer is called a 
symbol-spaced equalizer. In this case, the input to the equalizer is the sampled sequence 
given by Equation (10.5.19). However, we note that when 1/T < 2W, frequencies in the 
received signal that are above the folding frequency 1/T are aliased into frequencies below 
1/T. In this case, the equalizer compensates for the aliased channel-distorted signal. 

On the other hand, when the time delay t between adjacent taps is selected such that 
1/t > 2W > 1/T, no aliasing occurs; hence, the inverse channel equalizer compensates 
for the true channel distortion. Since t < T, the channel equalizer is said to have frac- 
tionally spaced taps, and it is called a fractionally spaced equalizer. In practice, t is often 
selected as t = T/2. Notice that, in this case, the sampling rate at the output of the filter 
Gr(f) is =. 

The impulse response of the FIR equalizer is 


N 


gelt) = >) cròt —nt), (10.5.23) 
n=—N 


and the corresponding frequency response is 


N 
"CEQ Ys ene OR (10.5.24 
n=—N 


where {cn} are the (2N + 1) equalizer coefficients and N is chosen sufficiently large 
so that the equalizer spans the length of the ISI, i.e., 2N + 1 > L. Since X(f) = 
Gr (f)C(f) Gr (f) and x(t) is the signal pulse corresponding to X (f), the equalized out: 
put signal pulse is 
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N 


q(t)= J. eax —nt). (10.5.25) 


n=—N 


The zero-forcing condition can now be applied to the samples of q(t) taken at times 
t = mT. These samples are 


N 


q(mT) = J` cax(mT —nt),  m=0,41,...,4N. (10.5.26) 
n=—N 


Since there are 2N + 1 equalizer coefficients, we can control only 2N + 1 sampled values 
of q(t). Specifically, we may force the conditions 


1,m=0 


0, m= +1, +2,..., 4N’ (10.5.27) 


N 
q(mT) = oy CnxX(mT — nt) = | 


n=—N 


which may be expressed in matrix form as Xc = q, where X isa (2N + 1) x (2N + D- 
matrix with elements {x(mT — nt)}, c is the (2N + 1) coefficient vector, and q is the 
(2N + 1) column vector with one nonzero element. Thus, we obtain a set of 2N + 1 linear 
equations for the coefficients of the zero-forcing equalizer. 

We should emphasize that the FIR zero-forcing equalizer does not completely elim- 
inate ISI because it has a finite length. However, as N is increased, the residual ISI can be 
reduced; in a limit as N —> œ, the ISI is completely eliminated. 


Example 10.5.3 
Consider a channel-distorted pulse x(t), at the input to the equalizér, given by the expression 
1 
x)=——, 
1+ (F 


where 1/T is the symbol rate. The pulse is sampled at the rate 2/T and equalized by a zero- 
forcing equalizer. Determine the coefficients of a five-tap zero-forcing equalizer. 


- 0v 
Solution According to Equation (10.5.27), the zero-forcing equalizer must satisfy the 
equations 


2 
q(mT) = J cax(mT —nT/2) = 


n=—2 


1, m=0 
0, m = 1, +2 ’ 


The matrix X with elements x(mT — nT /2) is given as 


Pe d ale. R 

5 10 17 26 37 

1 4? 4 L4 L 

2 5 0T 

=| 4 1 La 
X=|3213235 

Ae lll] 

~ | 7 0 5 2 

dpi iil 
-37 26 17 10 5 -~ 
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The coefficient vector ¢ and the vector q are given as 


C2 0 
C_j 0 
c=] © q=} 1 
Cj 0 
C2 0 


Then, the linear equations Xc = q can be solved by inverting the matrix X. Thus, we 
obtain 


Cop = Xq = —3 


One drawback to the zero-forcing equalizer is that it ignores the presence of addi. 
tive noise. As a consequence, its use may result in significant noise enhancement. This i: 
easily seen by noting that in a frequency range where C (f) is small, the channel equalize: 
Ge(f) = 1/C(f) compensates by placing a large gain in that frequency range. Conse 
quently, the noise in that frequency range is greatly enhanced. An alternative is to relax th 
zero ISI condition and select the channel equalizer characteristic such that the combinec 
power in the residual ISI and the additive noise at the output of the equalizer is minimized 
A channel equalizer that is optimized based on the minimum mean-square-error (MMSE 
criterion accomplishes the desired goal. 

To elaborate, let us consider the noise-corrupted output of the FIR equalizer 


which is 
N 


z(t)= >> cay(t—nt), (10.5.28 


n=—N 
where y(t) is the input to the equalizer, which is given by Equation (10.5.18). The outpu 
is sampled at times t = mT. Thus, we obtain 


N 
z(mT) = > cay(mT —nt). (10.5.29 
n=—N 


The desired response sample at the output of the equalizer at t = mT is the trans 
mitted symbol am. The error is defined as the difference between am and z(mT ). Then, th 
mean square error between the actual output sample z(mT ) and the desired values am is 


MSE = E(z(mT) — am)? 


N 
=E ( D cCny(mT — nt) — a) 
n=—N 


2 


N N N 
= J, DE enceRvin —k]—2 $ ceRavlk] + E (a2), (10.5.3 
k=—N 


n=—N k=—N 





Section 10.5 System Design in the Presence of Channel Distortion 589 
where the correlations are defined as 


Ry[n — k] = E (y(mT —nt)y(mT — kt)) 


and 


Raylk] = E(y(mT — kt)a,,) (10.5.31) 


and the expectation is taken with respect to the random information sequence {a,,} and the 
additive noise. 

The MMSE solution is obtained by differentiating Equation (10.5.30) with respect to 
the equalizer coefficients {c,,}. Thus, we obtain the necessary conditions for the MMSE as 


N 
D CnRry[n — k] = Rarik], k =0,+1,2,...,4N. (10.5.32) 
n=—N 


There are (2N + 1) linear equations for the equalizer coefficients. In contrast to the zero- 
forcing solution previously described, these equations depend on the statistical properties 
(the autocorrelation) of the noise as well as the ISI through the autocorrelation Ry [7]. 

In practice, we would not normally know the autocorrelation Ry[n] and the cross 
correlation Ray [n]. However, these correlation sequences can be estimated by transmitting 
a test signal over the channel and using the time average estimates 


K 
Rytn) = $ yk — nc)ykT) 
k=1 


and 


K 

a 1 

Rar[n] = E > y(kT — nt)ax (10.5.33) 
k=1 


in place of the ensemble averages to solve for the equalizer coefficients given by Equation 
(10.5.32). 


Adaptive Equalizers. We have shown that the tap coefficients of a linear equal- 
izer can be determined by solving a set of linear equations. In the zero-forcing optimiza- 
tion criterion, the linear equations are given by Equation (10.5.27). On the other hand, if the 
optimization criterion is based on minimizing the MSE, the optimum equalizer coefficients 
are determined by solving the set of linear equations given by Equation (10.5.32). 

In both cases, we may express the set of linear equations in the general matrix form 


Be=d, (10.5.34) 


where B is a (2N + 1) x (2N + 1) matrix, c is a column vector representing the 2N + 1 
equalizer coefficients, and d is a (2N + 1)-dimensional column vector. The solution of 
Equation (10.5.34) yields 
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Copt = Bd. (10.5.35 


In practical implementations of equalizers, the solution of Equation (10.5.34) for th, 
optimum coefficient vector is usually obtained by an iterative procedure that avoids thi 
explicit computation of the inverse of the matrix B. The simplest iterative procedure į; 
the method of steepest descent, in which one begins by choosing arbitrarily the coefficien 
vector c, e.g., Cg. This initial choice of coefficients corresponds to a point on the criterion 
function that is being optimized. For example, in the case of the MSE criterion, the initia 
guess cg corresponds to a point on the quadratic MSE surface in the (2N + 1)-dimensiona 
space of coefficients. The gradient vector, defined as go, which is the derivative of the MSI 
with respect to the 2N + 1 filter coefficients, is then computed at this point on the criterio 
surface and each tap coefficient is changed in the direction opposite to its correspondin; 
gradient component. The change in the jth tap coefficient is proportional to the size of th 
jth gradient component. 

For example, the gradient vector, denoted as g,, for the MSE criterion, found b 
taking the derivatives of the MSE with respect to each of the 2N + 1 coefficients, is 


g, = Be, —d, k= l, eri (10.5.3 
Then the coefficient vector cx is updated according to the relation 
Ck+1 = Ck —A By, (10.5.37 


where A is the step-size parameter for the iterative procedure. To ensure convergence ¢ 
the iterative procedure, A is chosen to be a small positive number. In such a case, th 
gradient vector g, converges toward zero, i.e., gą — 0 as k -> oo, andthe coefficient vec 
tor Ck — Copt, as illustrated in Figure 10.24 for two-dimensional optimization. In genera 
convergence of the equalizer tap coefficients to Copt cannot be attained in a finite numbe 
of iterations with the steepest descent method. However, the optimum solution Cop ca 
be approached as closely as desired in a few hundred iterations. In digital communic: 
tion systems that employ channel equalizers, each iteration corresponds to a time interv: 
for sending one symbol; hence, a few hundred iterations to achieve convergence to Co 
corresponds to a fraction of a second. 

Adaptive channel equalization is required for channels whose characteristics chang 
with time. In such a case, the ISI varies with time. The channel equalizer must track suc 
time variations in the channel response and adapt its coefficients to reduce the ISI. In th 
context of the preceding discussion, the optimum coefficient vector Copt varies with tin 
due to time variations in the matrix B and, for the case of the MSE criterion, time variatio 
in the vector d. Under these conditions, the iterative method previously described can t 
modified to use estimates of the gradient components. Thus, the algorithm for adjustir 
the equalizer tap coefficients may be expressed as 


kti = êk — A &, (10.5.3 


where ĝ, denotes an estimate of the gradient vector g, and é, denotes the estimate of tl 
tap coefficient vector. 
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ci 


Figure 10.24 Example of 
convergence characteristics of 
a gradient algorithm. 


In the case of the MSE criterion, the gradient vector g, given by Equation (10.5.36) 
may also be expressed as (see Problem 10.30) 


8r = Eleri). 
An estimate of the gradient vector at the kth iteration is computed as 
Êr = —ek Yk (10.5.39) 


where ex denotes the difference between the desired output from the equalizer at the kth 
time instant and the actual output z(kT), and y, denotes the column vector of 2N + 1 
received signal values contained in the equalizer at time instant k. The error signal is 
expressed as ; 


ek = Ak — Zk, (10.5.40) 


where zg = z(kT ) is the equalizer output given by Equation (10.5.29) and a, is the desired 
symbol. Hence, by substituting Equation (10.5.39) into Equation (10.5.38), we obtain the 
adaptive algorithm for optimizing the tap coefficients (based on the MSE criterion) as 


Chat = +A CRY (10.5.41) 


Since an estimate of the gradient vector is used in Equation (10.5.41), the algorithm is 
called a stochastic gradient algorithm. It is also known as the least mean square (LMS) 
algorithm. 

A block diagram of an adaptive equalizer that adapts its tap coefficients according 
to Equation (10.5.41) is illustrated in Figure 10.25. Note that the difference between the 
desired output a, and the actual output z, from the equalizer is used to form the error 
signal ex. This error is scaled by the step-size parameter A, and the scaled error signal 
Ae, multiplies the received signal values {y(kT — nt)} at the 2N + 1 taps. The products 
Aexy(kT — nt) at the (2N + 1) taps are then added to the previous values of the tap coef- 
ficients to obtain the updated tap coefficients, according to Equation (10.5.41). This com- 
putation is repeated for each received symbol. Thus, the equalizer coefficients are updated 
at the symbol rate. 
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Figure 10.25 Linear adaptive equalizer based on the MSE criterion. 


Initially, the adaptive equalizer is trained by the transmission of a known pseudorai 
dom sequence {a,,} over the channel. At the demodulator, the equalizer employs the know 
sequence to adjust its coefficients. Upon initial adjustment, the adaptive equalizer switch: 
from a training mode to a decision-directed mode, in which case the decisions at the outp 
of the detector are sufficiently reliable so that the error signal is formed by computing tt 
difference between the detector output and the equalizer output, i.e., 


ek = Õk — Zk, (10.5.4: 


where a, is the output of the detector. In general, decision errors at the output of the detect: 
occur infrequently; consequently, such errors have little effect on the performance of tl 
wacking algorithm given by Equation (10.5.41). 

A rule of thumb for selecting the step-size parameter to ensure convergence and got 
tracking capabilities in slowly varying channels is 


1 


[oe eek eee 10.5.4 
5(2N + 1)Pp ( 


where Pr denotes the received signal-plus-noise power, which can be estimated from tl 
received signal. 

The convergence characteristics of the stochastic gradient algorithm in Equati 
(10.5.41) is illustrated in Figure 10.26. These graphs were obtained from a computer sit 
ulation of an 11-tap adaptive equalizer operating on a channel with a rather modest amou 
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Figure 10.26 Initial convergence 
characteristics of the LMS algorithm 
with different step sizes. 
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of ISI. The input signal-plus-noise power Pr was normalized to unity. The rule of thumb 
given in Equation (10.5.43) for selecting the step size gives A = 0.018. The effect of mak- 
ing A too large is illustrated by the large jumps in MSE, as shown for A = 0.115. As 
A is decreased, the convergence is slowed somewhat, but a lower MSE is achieved; this 
indicates that the estimated coefficients are closer to Copt- 

Although we have described the operation of an adaptive equalizer that is optimized 
on the basis of the MSE criterion, the operation of an adaptive equalizer based on the zero- 
forcing method is very similar. The major difference lies in the method for estimating the 
gradient vectors g, at each iteration. A block diagram of an adaptive zero-forcing equalizer 
is shown in Figure 10.27. 


Decision-Feedback Equalizer. The linear filter equalizers previously described 
are very effective on channels, such as wireline telephone channels, where the ISI is not 
severe. The severity of the ISI is directly related to the spectral characteristics, and is not 
necessarily related to the time span of the ISI. For example, consider the ISI resulting from 
two channels illustrated in Figure 10.28. The time span for the ISI in channel A is five 
symbol intervals on each side of the desired signal component, which has a value of 0.72. 
On the other hand, the time span for the ISI in channel B is one symbol interval on each 
side of the desired signal component, which has a value of 0.815. The energy of the total 
response is normalized to unity for both channels. 

In spite of the shorter ISI span, channel B results in more severe ISI. This is evi- 
denced in the frequency-response characteristics of these channels, which are shown in 
Figure 10.29. We observe that channel B has a spectral null (the frequency response 
C(f) = 0 for some frequencies in the band |f| < W) at f = 1/2T, whereas this does 
not occur in the case of channel A. Consequently, a linear equalizer will introduce a large 
gain in its frequency response to compensate for the channel null. Thus, the noise in chan- 
nel B will be enhanced much more than in channel A. This implies that the performance 
of the linear equalizer for channel B will be significantly worse than that for channel A. 
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Figure 10.27 An adaptive zero-forcing equalizer. 
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Figure 10.29 Amplitude spectra for (a) channel A shown in Figure 10.28(a) and (b) channel B shown in 
Figure 10.28(b). 


This fact is borne out by the computer simulation results for the performance of the linear 
equalizer for the two channels, as shown in Figure 10.30. Hence, the basic limitation of a 
linear equalizer is that it performs poorly on channels having spectral nulls. Such channels 
are often encountered in radio communications, such as ionospheric transmission at fre- 


quencies below 30 MHz, and mobile radio channels, such as those used for cellular radio 
communications. 
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Figure 10.30 Error-rate performance of linear MSE equalizer. 
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Feedback y ; 
filter Figure 10.31 Block diagram 


of DFE. 


A decision-feedback equalizer (DFE) is a nonlinear equalizer that employs previou 
decisions to eliminate the ISI caused by previously detected symbols on the current symbc 
to be detected. A simple block diagram for a DFE is shown in Figure 10.31. The DF! 
consists of two filters. The first filter is called a feedforward filter, and it is generally 
fractionally spaced FIR filter with adjustable tap coefficients. This filter is identical in forr 
to the linear equalizer previously described. Its input is the received filtered signal y(t. 
The second filter is a feedback filter. It is implemented as an FIR filter with symbol-space 
taps having adjustable coefficients. Its input is the set of previously detected symbols. Th 
output of the feedback filter is subtracted from the output of the feedforward filter to fori 
the input to the detector. Thus, we have 


Nj N2 
Zm =) cny(mT — nt) — >) bndim—ns (10.5.4 
=0 n=1 
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where {c,} and {b,} are the adjustable coefficients of the feedforward and feedback filters, 
respectively, Ĝm-n, n = 1,2,..., N2 are the previously detected symbols, N, + 1 is the 
length of the feedforward filter, and N2 is the length of the feedback filter. Based on the 
input Zm, the detector determines which of the possible transmitted symbols is closest in 
distance to the input signal zm. Thus, it makes its decision and outputs @,,. What makes the 
DFE nonlinear is the nonlinear characteristic of the detector, which provides the input to 
the feedback filter. 

The tap coefficients of the feedforward and feedback filters are selected to optimize 
some desired performance measure. For mathematical simplicity, the MSE criterion is 
usually applied and a stochastic gradient algorithm is commonly used to implement an 
adaptive DFE. Figure 10.32 illustrates the block diagram of an adaptive DFE whose tap 
coefficients are adjusted by means of the LMS stochastic gradient algorithm. Figure 10.33 
illustrates the probability of error performance of the DFE, obtained by computer simula- 
tion, for binary PAM transmission over channel B. The gain in performance relative to that 
of a linear equalizer is clearly evident. 

We should mention that decision errors from the detector that are fed to the feedback 
filter have a small effect on the performance of the DFE. In general, there is a small loss in 
performance of 1 to 2 dB at error rates below 1072; this is due to decision errors, but these 
decision errors in the feedback filters are not catastrophic. The effect of decision errors in 
the feedback filter for channel B is also shown in Figure 10.33. 
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Figure 10.32 Adaptive DFE. 
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Figure 10.33 Performance of DFE with and without error propagation. 


Although the DFE outperforms a linear equalizer, it is not the optimum equalizer 
from the viewpoint of minimizing the probability of error. As indicated previously, the 
optimum detector in a digital communication system in the presence of ISI is an ML 
symbol sequence detector. Such a detector is particularly appropriate for channels witt 
severe ISI when the ISI spans only a few symbols. For example, Figure 10.34 illustrates 
the error probability performance of the Viterbi algorithm for a binary PAM signal trans- 
mitted through channel B. For purposes of comparison, we also illustrate the probability 
of error for a decision-feedback equalizer. Both results were obtained by computer simula- 
tion. We observe that the performance of the ML sequence detector is about 4.5 dB bette: 
than that of the DFE at an error probability of 1074. Hence, this is one example where 
the ML sequence detector provides a significant performance gain on a channel that has < 
relatively short ISI span. 

In conclusion, we mention that adaptive equalizers are widely used in high-speec 
digital communication systems for radio channels and telephone channels. For example 
high-speed telephone line modems (with a bit rate above 2400 bps) generally include ar 
adaptive equalizer that is implemented as an FIR filter with coefficients that are adjustec 
based on the MMSE criterion. Depending on the data speed, the equalizer typically span: 
between 20 and 70 symbols. The LMS algorithm given by Equation (10.5.41) is usually 
employed for the adjustment of the equalizer coefficients adaptively. 
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Figure 10.34 Performance of Viterbi detector and DFE for channel B. 


10.6 SUMMARY AND FURTHER READING 


In this chapter, we focused on the transmission of digital signals in bandlimited AWGN 
channels. We began with the characterization of bandlimited channels as linear bandlimited 
filters and demonstrated that such channel characteristics generally result in signal distor- 
tion. In digital signal transmission, channel distortion results in intersymbol interference. 

In order to limit the effects of channel distortion on the transmitted signals, we con- 
sidered the design of bandlimited signals for transmission in bandlimited channels. First, 
we demonstrated that the power spectrum of a digitally modulated signal, such as PAM, 
PSK, and QAM, is a function of the spectrum Gr (f) of the basic signal pulse gr (t) that 
is used at the transmitter to limit the bandwidth of the transmitted signal. The power spec- 
trum of the transmitted signal is also a function of the spectral characteristics of the data 
sequence {an}. We noted that when the data sequence has a zero mean, i.e., E(a,) = 0, 
the occurrence of the impulses (discrete frequency components at multiples of the symbol 
rate) in the power spectrum of the transmitted is completely avoided. This condition is eas- 
ily satisfied in the signal point constellations for PAM, PSK, and QAM that were described 
in Chapter 8. 

Then, we turned our attention to the design of bandlimited pulses gr (t) for transmis- 
sion in a bandlimited channel. Two distrinct methods were described. In the first method, 
we imposed the constraint that, in transmitting the signal through an ideal bandlimited 
channel, the received signal be free of any ISI. Under the constraint of zero ISI, Nyquist 
(1928) proved that the maximum data rate that can be transmitted in an ideal channel of 
bandwidth W is 2W symbols per second, i.e., 1/7 = 2W where T is the signal interval. 
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The symbol rate of 2W symbols per second is called the Nyquist rate. The only signal pulse 
gr(t) that can be used to achieve the Nyquist rate under the constraint of zero ISI is a sinc 
pulse whose spectrum is limited to W Hz and is constant within this bandwidth. However, 
the sinc pulse is not physically realizable. We demonstrated that it is possible to design 
physically realizable signal pulses that satisfy the zero ISI constraint, provided that the 
symbol wansmission rate 1/T satisfies the condition that 1/T < 2W. Signal pulses having 
a raised cosine spectrum were shown to possess this desirable property. Thus, the condition 
of zero ISI is satisfied at the cost of reducing the symbol rate below the Nyquist rate. 

By relaxing the constraint of zero ISI at the sampling instants, we demonstrated that 
a variety of bandlimited signal pulses can be designed that are bandlimited to W Hz and 
achieve the Nyquist rate of 2W symbols per second. Two examples that were described in 
detail are the duobinary and modified duobinary signal pulses, in which the ISI exists over 
two symbols. These signal types belong to the general class of so-called partial response 
signals. We observed that the primary undesirable characteristic in using partial response 
signal pulses for transmission in a bandlimited channel is the complexity in the implemen- 
tation of the optimum maximum-likelihood (ML) detector, which is a sequence detector 
as opposed to a simple symbol-by-symbol detector. The ML sequence detector is usually 
implemented by use of the Viterbi algorithm, which is described in Chapter 13. However, 
we showed that by accepting a modest penalty in the performance of the detector, a symbol- 
by-symbol detector can be easily implemented for the duobinary and modified duobinary 
signal pulses. 

Having designed bandlimited signal pulses for transmission in ideal bandlimited 
channels, the problem that is encountered in practice is that channel distortion and ISI 
results from transmission in nonideal channels. To compensate for such channel distortion, 
an additional filter or computational device is typically employed in the receiver of the 
communication system. Such a device is called an equalizer. If the channel is time invari- 
ant, its characteristics can be measured and the equalizer can be designed as a fixed (time- 
invariant) filter. However, if the channel impulse response varies slowly in time (compared 
to the symbol rate), the equalizer can be designed to adapt to the slow time variations 
of the channel characteristics. Two of the most commonly used equalizers are the lin- 
ear transversal (tapped-delay-line) filter and the decision-feedback filter, which consists of 
two separate tapped-delay-line filters. The optimum equalizer is based on the use of the 
ML criterion and is implemented efficiently by the Viterbi algorithm (VA) 

The pioneering work on signal design for bandwidth-constrained channels was done 
by Nyquist (1928). The use of binary partial response signals was originally proposed in 
the paper by Lender (1963) and was later generalized by Kretzmer (1966). The problem 
of optimum transmitter and receiver filter design was investigated by Gerst and Diamond 
(1961), Tufts (1965), Smith (1965), and Berger and Tufts (1967). 

Adaptive equalization for digital communication was introduced by Lucky (1965, 
1966). Widrow (1966) devised the LMS algorithm for adaptively adjusting the equalizer 
coefficients. 

The Viterbi algorithm was devised by Viterbi (1967) for the purpose of decod- 
ing convolutional codes, which are described in Chapter 13. Its use as the ML sequence 
detector for partial response signals and, more generally, for symbols corrupted by inter- 
symbol interference, was proposed and analyzed by Forney (1972) and Omura (1971). 
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A comprehensive treatment of adaptive equalization algorithms is given in the book by 
Proakis and Salehi (2008). 


APPENDIX 10A: POWER SPECTRUM OF MODULATED SIGNALS 


In this appendix, we derive the power spectrum of linearly modulated digital signals. We 
begin by deriving the power spectrum of the equivalent baseband signal, then, obtaining 
the power spectrum of the bandpass signal. 


10A.1 The Power Spectrum of the Baseband Signal 


As shown in Section 10.1.2, the equivalent baseband transmitted signal for a digital PAM, 
PSK, or QAM signal is represented in the general form as 


v(t) = J. angr(t —nT), (10A.1) 


n=—00 


where {a,} is the sequence of values selected from either a PAM, QAM, or PSK signal 
constellation corresponding to the information symbols from the source, and gy (t) is the 
impulse response of the transmitting filter. Since the information sequence {an} is random, 
v(t) is a sample function of a random process V(t). In this section, we evaluate the power 
density spectrum of V (t). Our approach is to derive the autocorrelation function of V (t) 
and then to determine its Fourier transform. 

First, the mean value of V (t) is 


EV) = J` E(an)gr(t —nT) 
=m, >. gr(t—nT), (10A.2) 


where ma is the mean value of the random sequence {an}. Note that although m, is a 
constant, the term }`„ gr(t — nT) is a periodic function with period T. Hence, the mean 
value of V(t) is periodic with period T. 

The autocorrelation function of V(t) is 


Rv(t+t,t) = E(V*(t)V(t+7T)) 


= $, dS E@san)gr(t—nT)gr(¢+t— mT). (10A.3) 


n=—0OO m=—CO 





In general, we assume that the information sequence {a,} is wide-sense stationary with 
autocorrelation sequence 








Raln] = E (až danym). (10A.4) 
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Hence, Equation (10A.3) may be expressed as 


Rvt+tt)= D> >> Ralm—ngr(t —nT)gr(¢ +t —mT) 


[e0] [ee] 


= D5 Rdm] Y` gr(t-nT)gr(t +t -nT —mT). 


m=—00 n=—00 


We observe that the second summation in Equation (10A.5), namely, 


foe) 
Yo art —nT)gr(t +t —nT — mT) 


n=—00 


Chapter 1 


(10A.5 


(10A.6 


is periodic with period T. Consequently, the autocorrelation function Ry(t + t, t) is peri 


odic in the variable t; i.e., 


Ryt+T+2,t+T) = Rv +t, 2). 


(10A.7 


Therefore, the random process V(t) has a periodic mean and a periodic autocorrelatior 


Such a random process is cyclostationary (see Definition 5.2.4). 


The power spectral density of a cyclostationary process can be determined by firs 
averaging the autocorrelation function Ry (t+ T, t) over a single period T and then comput 
ing the Fourier transform of the average autocorrelation function (see the Wiener—Khinchi 


Theorem in Section 5.2.5). Thus, we have 


E 1 T/2 
Rv(t) = T a Ry(t +1, t) dt 
-T/2 


co 


m=—00 n=—0O 
o0 —nT+T/2 
= >> Rain] sal J er(t)gr(t +t — mT) dt 
m=—©o n=—0o —nT—T/2 


7 Ly Ral] A gr(t)gr(t + t — mT) dt. 


fee) 


De Ral] 5 ie er(t —nT)gr(t +t —nT —mT)at 


(10A. 


We interpret the integral in Equation (10A.8) as the time-autocorrelation function of gr(i 


and define it as [see Equation (2.5.1)] 


R,(t) =f gr(t)gr(t + T) dt. 


With this definition, the average autocorrelation function of V (t) becomes 


Rv(t) = - XO Ralm)R,(t — mT). 


m=—0o 


(10A:! 


(10A.1 
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We observe that the expression for Ry(zt) in Equation (10A.10) has the form of a 
convolution sum. Hence the Fourier transform of Equation (10A.10) becomes 


Syf) = 5 Ry (tje t?" dt 


o0 


DS 00 
=a 5 Ratm) f R(t — mT J)e}? dr 
—00 


m=—00 
1 
= 58a AIGr AYP, (10A.11) 
where Sa (f) is the power spectrum of the information sequence {an}, defined as 
[oe] 
Saf) = >) Ralme Print (10A.12) 
m=—00 


and Gr (f) is the spectrum of the transmitting filter. |Gr (f)|? is the Fourier wansform of 
R,(t). 

The result in Equation (10A.11) illustrates the dependence of the power spectral den- 
sity Sy(f) of the transmitted signal on (a) the spectral characteristics Gr (f) of the trans- 
mitting filter and (b) the spectral characteristics S,(f) of the information sequence {an}. 
Both G7(f) and S,(f) can bedesigned to control the shape and form of the power spectral 
density of the transmitted signal. 


10A.2 The Power Spectrum of the Carrier Modulated Signals 


The relationship between the power spectrum of the baseband signal to the power spectrum 
of the bandpass signal is relatively simple. Let us consider the bandpass PAM signal as an 
example. The autocorrelation function of the bandpass signal 


u(t) = v(t) cos2n fot 
is 
Ry +1, t) = E(U(t)U (t +1)) 
= E(V (t) V(t + T))cos2n fit cos2n f(t + T) 
= Ry(t + T, t)cos2n fet cos2u fe(t +T). 


By expressing the product of the two cosine functions in terms of the cosine of the differ- 
ence plus the sum of the two angles, we obtain 


1 
Ryt+1,t) = vet + T, t)[cos 2x fet + cos2n f,(2t + T)]. 
Then, the average of Ry(t + t, t) over a single period T yields 


= 1- 
Ry(t)= a hv) cos 2T fot, (10A.13) 
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where the second term involving the double-frequency term averages to zero for each 


period of cos 4n fet. E 
The Fourier transform of Ry (t) yields the power spectrum of the bandpass signal as 


1 
Su(t) = qlsvf — fc) + Sv(f + fe]. (104.14 


Although the derivation that resulted in Equation (10A.14) was carried out for a bandpass 
PAM signal, the same expression applies to QAM and PSK. The three bandpass signal: 
differ only in the autocorrelation R,[m] of the sequence {a,} and, hence, in the powe: 
spectrum S,(f) of {an}. 


PROBLEMS 


10.1 In Example 10.1.1, the ideal channel of bandwidth W limits the transmitted signa 
energy that passes through the channel. The received signal energy as a function o 
the channel bandwidth is 


T WT +2 
$,(W) = i sin’ næ da 


(21)? J_wr œ? (1-a?) ” 
where a = fT. 


1. Evaluate (numerically) €, (W) for W = F x, 12, 2, 25, 3, and plot Saw) a 


a function of W. 
2. Determine the value of (W) in the limit as W — oo. For the computatior 
you may use the time-domain relation 





+00 


jim (W) = J 84 (t) dt. 


—00 
10.2 In a binary PAM system, the input to the detector is 
Ym = Am +m + im, 


where am = +1 is the desired signal, nm is a zero-mean Gaussian random variabl 
with variance o$, and i,, represents the ISI due to channel distortion. The ISI ten 
is a random variable which takes the values -3, 0, ; with probabilities j Ss ] 


respectively. Determine the average probability of error as a function of 0. 


10.3 In a binary PAM system, the clock that specifies the sampling of the correlati 
output is offset from the optimum sampling time by 10%. 


1. If the signal pulse used is rectangular, determine the loss in SNR due to tł 
mistiming. 

2. Determine the amount of intersymbol interference introduced by the mistimin 
and determine its effect on performance. 





Problems - 605 


10.4 The frequency-response characteristic of a lowpass channel can be approximated by 


10.5 


10.6 


10.8 


10.9 


_ fltacos2nft, lal<1, IfI <W 
C(f) = | 0, otherwise i 


where W is the channel bandwidth. An input signal s(t) whose spectrum is ban- 
dlimited to W/Hz, is passed through the channel. 


1. Show that the channel output is 
a 
y(t) = s(t) + 5 [s (t — to) + s (t + to)]. 


Thus, the channel produces a pair of echoes. 


2. Suppose the received signal y(t) is passed through a filter matched to s(t). Deter- 
mine the output of the matched filter att = kT, k = 0, +1, +2,..., where T is 
the symbol duration. 


3. What is the ISI pattern resulting from the channel if to = T? 


Show that a pulse having the raised cosine spectrum given by Equation (10.3.20) 
satisfies the Nyquist criterion given by Equation (10.3.7) for any value of the roll-off 
factor a. 


Show that for any value of œ, the raised cosine spectrum given by Equation (10.3.20) 
satisfies 
+00 
J Xr(f)df =1. 
—0o 
[Hint: Use the fact that X,c(f) satisfies the Nyquist criterion given by Equation 
(10.3.7).] 


Equation (10.3.7) gives the necessary and sufficient condition for the spectrum X (f) 
of the pulse x (t) that yields zero ISI. Prove that, for any pulse that is bandlimited to 
IfI < 1/T, the zero ISI condition is satisfied if Re[X (f)] for f > 0 consists of a 
rectangular function plus an arbitrary odd function about f = 1/27, and Im[X (f)] 
is any arbitrary even function about f = 1/2T. 


A channel has a passband characteristic in the frequency range |f| < 1400 Hz. 


1. Select a symbol rate and a PAM signal constellation size to achieve a 9600 bps 
signal transmission. 


2. If a square-root raised cosine pulse is used for the transmitter pulse gy (t), select 
the roll-off factor. Assume that the channel has an ideal frequency-response char- 
acteristic. 


Design an M-ary PAM system that transmits digital information over an ideal 
channel with bandwidth W = 2400 Hz. The bit rate is 14,400 bits/sec. Specify the 
number of transmitted points, the number of received signal points using a duobi- 
nary signal pulse, and the required €, to achieve an error probability of 10°. The 
additive noise is zero-mean Gaussian with a power spectral density 1074 W/Hz. 
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10.10 When the additive noise at the input to the demodulator is colored, the filter matched 
to the signal no longer maximizes the output SNR. In such a case, we may consider 
the use of a prefilter that “whitens” the colored noise. The prefilter is followed by a 
filter matched to the prefiltered signal. Toward this end, consider the configuration 
shown in Figure P-10.10. 


1. Determine the frequency-response characteristic of the prefilter that whitens the 
noise. 
2. Determine the frequency-response characteristic of the filter matched to S(t). 


3. Consider the prefilter and the matched filter as a single “generalized matched 
filter.” What is the frequency-response characteristic of this filter? 


4. Determine the SNR at the input to the detector. 


r(t) = s(t) + n(t) |Prewhitening| f(#) = 5(¢) + n@) Filter 
z = filter =| matched O Detector 
n(t) is H,(f) to 5(t) Sample : 


colored noise att=T 






Figure P-10.10 











10.11 Consider the transmission of data via PAM over a channel that has a bandwidth of 
1500 Hz. Show how the symbol rate varies as a function of the excess bandwidth. 
In particular, determine the symbol rate for excess bandwidths of 25%, 33%, 50%, 
67%, 75%, and 100%. 


10.12 The binary sequence 10010110010 is the input to the precoder whose output is used 
to modulate a duobinary transmitting filter. Construct a table as in Table 10.2; show 
the precoded sequence, the transmitted amplitude levels, the received signal levels, 
and the decoded sequence. 


10.13 Repeat Problem 10.12 for a modified duobinary signal pulse. 


10.14 A precoder for a partial response signal fails to work if the desired partial response 
at n = Ois zero modulo M. For example, consider the desired response for M = 2: 





2, n=0 

l, n=1 
x(nT) = -1 n=2 

0, otherwise 


Show why this response cannot be precoded. 


10.15 A baseband digital communication system employs the signals shown in Figure 
P-10.15(a) for the transmission of two equiprobable messages. It is assumed that 
the communication problem studied here is a “one shot” communication problem, 

















Problems 5 607 


i.e., the messages are transmitted just once and no transmission takes place after- 
ward. The channel has no attenuation and the noise is AWG with power spectral 
density %, 


1. Find an appropriate orthonormal basis for the representation of the signals. 


2. In a block diagram, give the precise specifications of the optimal receiver using 
matched filters. Label the block diagram carefully. 


3. Find the error probability of the optimal receiver. 


4. Show that the optimal receiver can be implemented by using just one filter. [See 
the block diagram shown in Figure P-10.15(b).] What are the characteristics of 
the matched filter and the sampler and decision device? 


5. Now assume the channel is not ideal, but has an impulse response of c(t) = 
O(t) + i(t — T), Using the same matched filter as the previous part, design an 
optimal receiver. 


6. Assume that the channel impulse response is c(t) = ô(t) + ad(t — T), where 
a is a random variable uniformly distributed on [0, 1]. Using the same matched 
filter, design the optimal receiver. 


si() } 52(t) | 





Ee eae ee AT eel aye E E: 





i a t 0 T T t 
2 2 
(a) 
AWGN 
h(t) 
ae Modulator Matched Sampler and Output 
filter decision 


(b) 


Figure P-10.15 


10.16 Sketch and label the trellis for aduobinary signal waveform used in conjunction with 
the precoding given by Equation (10.4.2). Repeat this for the modified duobinary 
signal waveform with the precoder given by Equation (10.4.14). Comment on any 
similarities and differences. 
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10.17 A binary PAM signal is generated by exciting a raised-cosine roll-off filter with 
a 50% roll-off factor and then DSB-SC amplitude modulating it on a sinusoidal 
carrier, as illustrated in Figure P-10.17. The bit rate is 2400 bps. 


1. Determine the spectrum of the modulated binary PAM signal and sketch it. 


2. Draw the block diagram illustrating the optimum demodulator/detector for the 
received signal, which is equal to the transmitted signal plus additive white 
Gaussian noise. 





Sn et- nT) | Filter with 2 
n raised cosine AWGN rA) 
spectrum channel 
g(t) = — 
Carrier 
c(t) Figure P-10.17 


10.18 Anideal voice-band telephone line channel has a bandpass frequency-response char- 
acteristic spanning the frequency range 600-3000 Hz. 


1. Design an M = 4 PSK (quadrature PSK or QPSK) system for transmitting data 
at a rate of 2400 bits/sec and a carrier frequency fe = 1800. For spectral shaping, 
use a raised cosine frequency-response characteristic. Sketch a block diagram of 
the system and describe its functional operation. 


2. Repeat Part 1 if the bit rate is R = 4800 bits/sec. 


10.19 A voice-band telephone channel passes the frequencies in the band from 300-3300 
Hz. Wewant to design a modem that transmits at asymbol rate of 2400 symbols/sec, 
and our objective is to achieve 9600 bits/sec. Select an appropriate QAM signal con- 
stellation, carrier frequency, and the roll-off factor of a pulse with a raised cosine 
spectrum that utilizes the entire frequency band. Sketch the spectrum of the trans- 
mitted signal pulse and indicate the important frequencies. 


10.20 Consider a digital communication system that transmits information via QAM over 
a voice-band telephone channel at a rate 2400 symbols per second. The additive 
noise is assumed to be white and Gaussian. 


1. Determine the ,,,/No required to achieve an error probability of 1075 at 
4800 bps. 


2. Repeat Part 1 for a bit rate of 9600 bps. 
3. Repeat Part 1 for a bit rate of 19,200 bps. 


4. What conclusions do you reach from these results? 





ie A Ok i A lc a lO 
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10.21 Determine the bit rate that can be transmitted through a 4 kHz voice-band tele- 


phone (bandpass) channel if we use the following modulation methods: (a) binary 
PSK, (b) four-phase PSK, (c) eight-point QAM, (d) binary orthogonal FSK with 
noncoherent detection, (e) orthogonal four-FSK with noncoherent detection, and (f) 
orthogonal eight-FSK with noncoherent detection. For Parts (a) to (c), assume that 
the transmitter pulse shape has a raised cosine spectrum with a 50% roll-off. 


10.22 Consider the use of a (square-root) raised cosine signal pulse with a roll-off factor 


of unity for the transmission of binary PAM over an ideal bandlimited channel that 
passes the pulse without distortion. Thus, the transmitted signal is 


v(t) = D ak8gr(t — kT), 
k=—00 


where the signal interval T, = T /2 and the symbol rate doubles for no ISI. 


1. Determine the ISI values at the output of a matched filter demodulator. 


2. Sketch the trellis for the maximum likelihood sequence detector. Label the states. 


10.23 A binary antipodal signal is wansmitted over a nonideal bandlimited channel, which 


introduces ISI over two adjacent symbols. For an isolated transmitted signal pulse 
s(t), the (noise-free) output of the demodulator is ./€, at t = T and /@,/4 at 
t = 2T at zero fort = kT,k > 2, where ©, is the signal energy and T is the 
signaling interval. 


1. Determine the average probability of error assuming that the two signals are 
equally probable and the additive noise is white and Gaussian. 


2. By plotting the error probability obtained in Part 1 and the error probability 
for the case of no ISI, determine the relative difference in SNR of the error 
probability of 1076. 


10.24 Determine the frequency-response characteristics forthe RC circuit shown in Figure 


P-10.24. Also determine the expression for the envelope delay. 


| C Output 
ee Figure P-10.24 


10.25 Consider the RC lowpass filter shown in Figure P-10.24 where t = RC = 10~°. 
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1. Determine and sketch the envelope (group) delay of the filter as a function of 
frequency. 


2. Suppose that the input to the filter is a lowpass signal of bandwidth Af = 1 
kHz. Determine the effect of the RC filter on this signal. 


10.26 A microwave radio channel has a frequency-response 


C(f) =14+0.3cos2nfT. 


Determine the frequency-response characteristic for the optimum transmitting and 
receiving filters that yield zero ISI at a rate of 1/T symbols/sec and have an excess 
bandwidth of 50%. Assume that the additive noise spectrum is flat. 


10.27 An M = 4 PAM modulation is used for transmitting at a bit rate of 9600 bits/sec on 
a channel having a frequency response 


1 
Ch=——F 
1+ 7 7400 
where |f| < 2400, and C(f) = 0, otherwise. The additive noise is zero-mean, 
white Gaussian with power spectral density ba W/Hz. Determine the (magnitude) 
frequency-response characteristic of the optimum transmitting and receiving filters. 


10.28 Binary PAM is used to transmit information over an unequalized linear filter chan- 
nel. When a = 1 is transmitted, the noise-free output of the demodulator is 


0.3, m=1 
PRS 0.9, m=0 
™ ) 0.3, m=-—1 


0, otherwise 


1. Design a three-tap zero forcing linear equalizer so that the output is 


= l, m=0 
m=) 0, m= = 


2. Determine qm form = +2, +3 by convolving the impulse response of the equal- 
izer with the channel response. 


10.29 The transmission of a signal pulse with a raised cosine spectrum through a channel 
results in the following (noise-free) sampled output from the demodulator: 


—0.5, k=-2 
0.1, k=-1 
fs 1, k=0 
k=) -0.2, k=1 
0.05, k=2 


0, otherwise 


Se PALS eee eee ee ae ge 
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10.30 


10.31 


10.32 


10.33 


1. Determine the tap coefficients of a three-tap linear equalizer based on the zero- 
forcing criterion. 


2. For the coefficients determined in Part 1, determine the output of the equalizer 
for the case of the isolated pulse. Thus, determine the residual ISI and its span 
in time. 


Show that the gradient vector in the minimization of the MSE may be expressed as 
By = E (eY), 
where the error ex = ap — zk and the estimate of g, i.e., 
Be = kY 
satisfies the condition that E(g,) = g}. 


A nonideal bandlimited channel introduces ISI over three successive symbols. The 
(noise-free) response of the matched filter demodulator sampled at the sampling ’ 
time t = kT is 


Eb, k=0 
= _ J 098., k= +1 
J swse —kT)dt = 0.18, k= 42 
0, otherwise 


Determine the tap coefficients of a three-tap linear equalizer that equalizes the chan- 
nel (received signal) response to an equivalent partial response (duobinary) signal 


_[S, k=0,1 
Ye = 0, otherwise ` 


Determine the tap weight coefficients of a three-tap zero-forcing equalizer if the ISI 
spans three symbols and is characterized by the values x(0) = 1, x(—1) = 0.3, and 
x(1) = 0.2. Also, determine the residual ISI at the output of the equalizer for the 
optimum tap coefficients. 


In line-of-sight microwave radio transmission, the signal arrives at the receiver via 
two propagation paths: the direct path and a delayed path that occurs due to signal 
reflection from the surrounding terrain. Suppose that the received signal has the 
form 

r(t) =s(t) +as(t — T) +n(t), 


where s(t) is the transmitted signal, œ is the attenuation (œ < 1) of the secondary 
path, and n(t) is AWGN. 


1. Determine the output of the demodulator at t = T andt = 2T that employs a 
filter matched to s(t). 
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2. Determine the probability of error for a symbol-by-symbol detector if the trans- 
mitted signal is binary antipodal and the detector ignores the ISI. 


3. What is the error-rate performance of a simple (one-tap) DFE that estimates œ 
and removes the ISI? Sketch the detector structure that employs a DFE. 


10.34 Repeat Problem 10.32 and use the MMSE criterion for optimizing the tap coeffi- 
cients. Assume that the noise power spectrum is 0.1 W/Hz. 








10.35 Show that the covariance matrix C for the noise at the output of the matched filter 
for the duobinary pulse is given by Equation (10.4.35). 


10.36 A wireline channel of length 1000 km is used to transmit data via binary PAM. 
Regenerative repeaters are spaced 50 km apart along the system. Each segment of 
the channel has an ideal (constant) frequency response over the frequency band 
0 < f < 1200 and an attenuation of 1 dB/km. The channel noise is AWGN. 


1. What is the highest bit rate that can be transmitted without ISI? 


2. Determine the required $,/No to achieve a bit error of P) = 1077 for each 
repeater. 


3. Determine the transmitted power at each repeater to achieve the desired @;/No, 
where No = 4.1 x 1072! W/Hz. 


10.37 (Carrierless QAM or PSK Modem) Consider the transmission of a QAM or M-ary 
PSK (M > 4) signal at a carrier frequency f,, where the carrier is comparable to 
the bandwidth of the baseband signal. The bandpass signal may be represented as 


s(t) =Re £ ang(t — ane] 


n 














1. Show that s(t) can be expressed as 


s(t) = Re [Esee = | ; 


n 


where Q(t) is defined as 
Ot) =q) + JG), 
q(t) = g(t) cos 27t fet, 
q(t) = g(t) sin 27t fot 
and a’ is a phase-rotated symbol, i.e.,a/, = ape ™ Sent, 
2. Using filters with responses q(t) and q(t), sketch the block diagram of the mod- 


ulator and demodulator implementation that does not require the mixer to trans- 
late the signal to bandpass at the modulator and to baseband at the demodulator. 
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10.38 [Carrierless amplitude or phase (CAP) modulation] In some practical applications in 
wireline data transmission, the bandwidth of the signal to be transmitted is compa- 
rable to the carrier frequency. In such systems, it is possible to eliminate the step of 
mixing the baseband signal with the carrier component. Instead, the bandpass signal 
can be directly synthesized by embedding the carrier component in the realization 
of the shaping filter. Thus, the modem is realized, as shown in the block diagram in 
Figure P-10.38, where the shaping filters have the impulse responses 


q(t) = g(t) cos 27 fet, 
Q(t) = g(t) sin 27 fet 


and g(t) is a pulse that has a square-root raised cosine spectral characteristic. 


1. Show that 
foe) 
f q(t)g(t) dt =0 


—00 


and that the system can be used to transmit two-dimensional signals, e.g., PSK 
and QAM. 


2. Under what conditions is this CAP modem identical to the carrierless QAM/PSK 
modem treated in Problem 10.37? 





Shaping 





. >| fiter 
Serial + 
Input | to q(t) l 
data “| parallel E To transmitter 
converter apıng = 
i———|_ filter 
â (e) 





(a) Modulator 





Filter or 
correlator |= ~>] Detector 








Received q@ and Output 
er —— rs parallel ant 
se Filter or to serial ata 
-—>| correlator =y ——|_ converter 
CLO) 
Sample 
att= kT 
(b) Demodulator 


Figure P-10.38 
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COMPUTER PROBLEMS 


10.1 Linear Filter Model of a Communication Channel 


As indicated in this chapter, a bandlimited communication channel can be mod- 
eled as a linear filter whose frequency response characteristics match the frequency 
response characteristics of the channel. Therefore, we may design digital finite- 
duration impulse response (FIR) or infinite-duration impulse response (IIR) filters 
that approximate the frequency-response characteristics of analog communication 
channels. 


Suppose that we wish to model an ideal channel that has an amplitude response 
A(f) = 1 for |f] < 2000 Hz and A(f) = 0 for |f| > 2000 Hz, and a constant 
delay (linear phase) for |f] < 2000 Hz. The sampling rate for the digital filter is 
selected as F; = 10,000 Hz. Since the desired phase response is linear, only an 
FIR filter could satisfy this condition. However, it is not possible to achieve a zero 
response in the stopband. Instead, we select the stopband response to be —40 dB and 
the stopband frequency to be 2500 Hz. In addition, we allow for a small amount, 
0.5 dB, of ripple in the passband. 


Design an FIR filter with these characteristics. Plot the frequency and phase response 
of the filter in the frequency band0 < f < 5000 Hz. Note that 5000 Hz is the folding 
frequency. j 


10.2 Effect of Intersymbol Interference 


The purpose of this problem is to view the effect of intersymbol interference (ISI) 
on the received signal sequence {y,,} for two channels that are characterized by the 
discrete time responses {xn} as follows: 


Channel 1 
1, n=0 
eas —0.25, n = +1 
™™ į 0.1, n= +2 
0, otherwise 
and 
Channel 2 
1, n=0 
mE 0.5, n=l 
™~) —0.2, n= +2 
0, otherwise 


Note that in these channels, the ISI is limited to two symbols on either side of the 
desired transmitted signal. Hence, the cascade of the transmitter and receiver filters 
and the channel at the sampling instants are represented by the equivalent discrete- 
time FIR channel filter shown in Figure CP-9.3. 
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Suppose that the transmitted signal sequence is binary (+1), and let the received 
sequence at the output of the equivalent discrete-time FIR channel filter be denoted 
as {yx}. Therefore, the output sequence {yg} can be expressed as 


Yk = Ak + X1ak—1 + X—1đk+1 + X24k—2 + X—24k+2, K=1,2,..., 


where {ap = +1} is the input data sequence. Assuming that a, = 1, compute and 
plot the values of {y,} for the 16 possible data sequences {ay_1, Ak+1, Ak—-2, Ak+2} 
for channel 1 and channel 2. Use separate plots for the two channels. Repeat the 
experiment when a, = —1. Which channel characteristic results in detector errors 
due to ISI even in the absence of noise? 


Repeat the preceding experiments when the channel output sequence is corrupted by 
additive zero-mean white Gaussian noise with the variance c? = 0.1. Compare these 
results with the noiseless output and comment on the effect of the additive noise on 
the two channel output sequence. 





—0.25 —0.25 





(b) 


Figure CP-10.3 FIR channel models with ISI: (a) channel 1; (b) channel 2. 


10.3 Design of Optimum Transmitter and Receiver Filters 
The objective of this problem is to design a digital implementation of the transmitter 
and receiver filters, Gr (f) and Gr(f), such that 
IGr(f)IIGa(f)| = Xre(f) 


and Gr(f) is the matched filter to Gr (f). 


The simplest way to design and implement the transmitter and receiver filters in 
digital form is to employ FIR filters with linear phase (symmetric impulse response). 
If X;c(f) is the desired raised cosine frequency response with a specified roll-off 
parameter, the magnitude response of the transmitter and receiver filters is 
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10.4 
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IGr(f)| = IGRI = V Xre(f).- 


The frequency response of the filters is related to the impulse response by the Fourier 


transform relation 
(N-1)/2 


Gr(fy= >) grm) en, (A) 


n=—(N—1)/2 


where T's is the sampling interval and N is the length of the filter. Note that N is odd. 
Since Gr(f) is bandlimited, we may select the sampling frequency Fs to be at least 
2/T. In particular, let 


or equivalently, Ts = T/4. Hence, the folding frequency is F;/2 = 2/T. Since 


IGr(f)| = /Xre(f), we may sample X,,(f) at equally spaced points in frequency, 
with the frequency separation Af = F;/N. Thus, we have 


F (N—1)/2 
Vy Xr(mAf) = | Xr (=) = 5 gr(n) gry, (B 


N n=—(N—1)/2 


The inverse transform relation is used to obtain the impulse response, i.e., 


N2 e aN 
Am NN N-1 
pos YT Xe are ean, n=0.21,...¢("S), C 
m=—(N—1)/2 NT 2 


Since gr(n) is symmetric, the impulse response of the desired linear phase trans- 
mitter filter is obtained by delaying gr(n) by (N — 1)/2 samples, i.e., we have 
8T (n — %4), n=0,1,...,N-—1. 


1. Determine and plot the impulse response gr [n] of the transmitter filter of lengtt 
N = 31 when X,c(f) has a roll-off factor a = 1/4, and 1/T = 1800 Hz. 


2. Determine and plot the frequency response |Gr(f)| forO < f < 05 fron 
Equation (A). Does Gr(f) = 0 for |f| > (1 + @)/T? Explain why it does o1 
does not. 


3. Plot |Gr(f)|* and X(f) on the same graph and compare the two graphs 
Explain the difference. 


4. Repeat Steps 1, 2, and 3 fora = 1/4, 1/T = 1800 Hz, and N = 41. Compart 
and comment in the results. 


Design of a Duobinary Signal Filter 


The objective of this problem is to design a digital implementation of the transmitte 
and receiver filters Gr (f) and Gr(f) such that their product is equal to the spectrun 
of a duobinary pulse and Gr (f) is the matched filter to Gr (f). 
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To satisfy the frequency domain specification, we have 


1 (55) 
— cos (2>), IfI <W 
Ier(niierpi=i” Vw l 


0, Ifl >W 
Therefore, 
leo (34), IfI <W 
Ierpl={ VW W ; 
0, IfI > W 


By following the same approach as in Computer Problem 10.4, we obtain the impulse 
responses for a linear-phase FIR implementation of the transmitter and receiver fil- 
ters. Therefore, with W = 1/2T and F, = 4/T, we have 


peg 4m N-1 
grin] = 5 or(=) giznmni N n= 0,1... (2) 
n=—(N—1)/2 2 








and grin] = grin]. 


1. Determine and plot gy [n — 4*] for W = 1800 and N = 31. 
2. By using Equation (A) of Computer Problem 9.4, determine and plot |G7(f)|. 


3. Plot |Gr(f)|? and the ideal duobinary (cosine) spectrum on the same graph and 
compare the two spectra. Explain the difference. 


10.5 Precoding for Duobinary Signals 


Write a MATLAB program that takes a binary data sequence {d}, precodes it for 
a duobinary pulse transmission system to produce the sequence {p}, and maps the 
precoded sequence into the transmitted amplitude levels {az}. Then, from the trans- 
mitted sequence {az}, and from the received noise-free sequences {bg}, recover the 
original data sequence {d,}. Verify the operation of your program using the data 
sequence {100101110110}. 


10.6 Simulation of Detector Performance for Duobinary Signal 


The objective of this problem is to simulate a binary PAM communication system 
that employs a duobinary signal pulse, where the precoding and amplitude conver- 
sion that yields the sequence {a,,} are performed as prescribed in the text. Hence, the 
input to the detector is the sequence 


Ye = bk + Nk 
= Ap +ak-ı tn, k+1,2, 
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where the sequence {ng} is zero mean, Gaussian, and uncorrelated. The variance of 
Nk iso a 

Perform the simulation for 10,000 bits and measure the bit-error probability for 
a? = 0.1,07 = 0.5, and ø? = 1. Plot the theoretical error probability for binary 
PAM with no ISI, and compare the simulation results with this ideal performance, 
You should observe some degradation in the performance of the duobinary system, 


What are the approximate values of the degradation for o? = 0.1,07 = 0.5, and 
o? =1? 


Zero-Forcing Equalizer 


The objective of this problem is to design a zero-forcing equalizer for the channel- 
distorted pulse x(t) at the input to the equalizer given by the expression 


1 
1+ 
where 1/T is the symbol rate. The pulse is sampled at the rate 2/T, which is the 
input rate to the equalizer. 


x(t) = 


As described in the text, the zero-forcing equalizer must satisfy the equations 


us nT 1,m=0 
q(mT) = D eux (mr —) = |) m = l, 2, ..., £K’ 


n=—K 


where 2K + 1 is the number of taps in the equalizer and {c,} are the equalizer 
coefficients. 


Write a program that computes the equalizer coefficients for any value of K. Com- 
pute the equalizer coefficients for K = 2,4, and 6. Plot the input sequence to the 
equalizer for the pulse x(t), and plot the equalizer output samples foreach K = 2, 4, 
and 6. Compare how well the equalizer performs for each value of K by comparing 
the residual ISI at the output of the equalizer. 


MSE Equalizer 


The objective of this problem is to design an equalizer, based on the mean-square- 
error criterion, for the channel-distorted pulse described by the expression 


1 


*O= Tyre’ 


where 1/T is the symbol rate. The pulse is sampled at the rate 2/7, which is the 
input rate to the equalizer. The information symbols have zero mean, unit variance, 
and are uncorrelated, i.e., 


E(an) = 0; 
E(\an\?) = 1; 
E(anam) =0, nAm. 
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The additive noise samples at the input to the equalizer are zero-mean uncorrelated 
Gaussian with variance o? = 0.01 and o? = 0.1. 


Write a program that computes the coefficients of the equalizer having 2 K + 1 taps. 
Compute the equalizer coefficients for K = 2, 4, and 6. Plot the sequence to the 
equalizer for the pulse x(t), and plot the equalizer output samples for each K = 2, 
4, and 6. Compare how well the equalizer performs for each value of K by computing 
the residual ISI at the output of the equalizer. 





Output 
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Figure CP-10.9 Linear adaptive equalizer based on the MSE criterion. 





10.9 Simulation of an Adaptive Equalizer 


The objective of this problem is to implement an adaptive equalizer based on the 
LMS algorithm. The channel is modeled as an FIR filter with symbol-spaced values 
that are given as follows: 


x = [0.05, —0.063, 0.088, —0.126, —0.25, 0.9047, 0.25, 0.126, 0.038, 0.088]. 


The MSE equalizer is also an FIR filter with symbol-spaced tap coefficients. Training 
symbols are transmitted initially to train the equalizer. In the data mode, the equal- 
izer employs the output of the detector in forming the error signal used in the LMS 
algorithm. The block diagram of the system is shown in Figure CP-10.9. 
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Write a program that performs the simulation of the system in Figure CP-10.9. Use 
1000 training (binary) symbols and 10,000 binary data symbols for the FIR channel 
model previously given. Use ø? = 0.01, o? = 0.1, and ø? = 1 for the variance of 
the additive, zero-mean Gaussian noise sequence. Compare the measured error rate 
with that of an ideal channel with no ISI. 
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In Section 10.5, we considered digital transmission through nonideal channels, and we 
observed that such channels cause intersymbol interference when the reciprocal of the sym- 
bol rate is significantly smaller than the time dispersion (duration of the impulse response) 
of the nonideal channel. In such a case, the receiver employs a channel equalizer to com- 
pensate for the channel distortion. If the channel is a bandpass channel with a specified 
bandwidth, the information-bearing signal may be generated at the baseband and then 
translated in frequency to the passband of the channel. Thus, the information-bearing signal 
is wansmitted on a single carrier. We also observed that intersymbol interference usually 
results in some performance degradation, even in the case where the optimum detector is 
used to recover the information symbols at the receiver. 


11.1 ORTHOGONAL FREQUENCY-DIVISION MULTIPLEXING 


We can use an alternative approach to the design of a bandwidth-efficient communication 
system in the presence of channel distortion. Here, we subdivide the available channel 
bandwidth into a number of equal-bandwidth subchannels, where the bandwidth of each 
subchannel is sufficiently narrow so that the frequency-response characteristics of the sub- 
channels are nearly ideal. Such a subdivision is illustrated in Figure 11.1. Thus, we create 
K = W/Af subchannels, where different information symbols can be transmitted simul- 
taneously in K subchannels. Consequently, the data is transmitted by frequency-division 
multiplexing (FDM). 
With each subchannel, we associate a carrier 


x(t) =cos2un fkt, k=0,1,...,K —1, (11.1.1) 


where fk is the mid-frequency in the kth subchannel. By selecting the symbol rate 1/T on 
each of the subchannels to be equal to the separation Af of the adjacent subcarriers, the 
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| pra bee ee mn a, 


ICCP) rr 
1 Figure 11.1 Subdivision of the channel 
0 =e w f bandwidth W into narrowband 
Af subchannels of equal width Af. 


subcarriers are orthogonal over the symbol interval T, independently of the relative phase 
relationship between subcarriers. That is, 


T 
f cos(27 fkt + $k) cos(2n fjt + ġ;)dt = 0, (11.1.2) 
0 


where f, — fj =n/T,n = 1, 2,..., independently of the values of the phases ġx and ¢ je 
In this case, we have orthogonal frequency-division multiplexing (OFDM). 

Multicarrier modulation (OFDM) has been used in both wireline and radio channels. 
In particular, it is used in digital subscriber loop (DSL) modems that are used to pro- 
vide high-speed internet access to homes and businesses. OFDM is also used in wireless 
local area networks (LANs) that are used in homes and offices for wireless access to the 
internet. 











11.2 MODULATION AND DEMODULATION IN AN OFDM SYSTEM 


In an OFDM system with K subchannels, the subcarrier frequencies are {cos2n fyt, 
0 < k < K — 1}, where adjacent subcarrier frequencies are separated by Af = 1/T, 
i.e., fear — fk = Af = 1/T and T is the symbol interval. The symbol rate 1/T is 
reduced by a factor of K relative to the symbol rate on a single-carrier system that employs 
the entire bandwidth W and transmits data at the same rate as OFDM. Hence, the sym- 
bol interval in the OFDM system is T = KT;, where Ts is the symbol interval in the 
single-carrier system. By selecting K to be sufficiently large, the symbol interval T can 
be made significantly larger than the time duration of the channel-time dispersion. 
Thus, intersymbol interference can be made arbitrarily small through the selection 
of K. Inother words, each subchannel appears to have a fixed frequency response 
C( fk), kK =0,1,...,K —1. 

Suppose that each subcarrier is modulated with M-ary quadrature amplitude modula- 
tion (QAM). Then, the signal on the kth subcarrier may be expressed as (where 
gr(t) = /2/T,0 <t <T) 
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Figure 11.2 The graph of wg (t) in Example 11.2.1. 


/2 /2 
u(t) = Tp Ate cos 27 fkt — pats sin 27 fkt 
2 jO, „j2n fkt 
= Re| ,/ = Aget% ejr fi 
T 


Denar 
= re fieran |, (11.2.1) 


where X;, = Axe/® is the signal point from the QAM signal constellation that is transmit- 


ted on the kth subcarrier, A, = ,/ AZ, + Ag,, and 6 = tan! (Axs/Axc). The energy per 
symbol é, has been absorbed into {X;}. 








Example 11.2.1 


The signal point X, = 3 + j1 selected from an M = 8 QAM signal constellation is to be 
transmitted on the frequency fe = 6/T, where T = 50 seconds is the symbol duration. Using 
Equation (11.2.1), compute and plot the waveform u¢(t). 


Solution Figure 11.2 shows the graph of u6(t) a 


When the number of subchannels is large, so that the subchannels are sufficiently 
narrowband, each subchannel can be characterized by a fixed frequency response C( fx), 
k =0,1,..., K — 1. In general, C( fk) is complex valued and may be expressed as 


C( fi) = Ck = |Cxle/*. (11.2.2) 
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Hence, the received signal on the kth subchannel is 


2 2 
r(t) = eT cos (2x fkt + Oy) — T i sin (2x fkt + pk) + ng(t) 


2 . 
= report] +n,(t), (11.2.3) 


where ng(t) represents the additive noise in the kth subchannel. We assume that ng(t) is 
zero-mean Gaussian and spectrally flat across the bandwidth of the kth subchannel. We also 
assume that the channel parameters |C;| and ¢, are known at the receiver. (These para- 
meters are usually estimated by initially transmitting the unmodulated carrier cos 27 fyt 
and observing the received signal |C;| cos(21fxt + dx.) 


Example 11.2.2 


Consider the signal generated in Example 11.2.1. Suppose the channel gain and phase shift at 
frequency fg is 


ae 
C(fe) = sei” 


or |C(fe)| = 1/2 and ġs = 1/2. Compute and plot the received waveform r¢(t) in the absence 
of noise. 


Solution The graph for re(t) is shown in Figure 11.3. | 


The demodulation of the received signal in the kth subchannel may be accomplished 
by cross correlating r(t) with the two basis functions, based on knowledge of the carrier 
phase {@,} at the receiver, 
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Figure 11.3 The graph of r¢(t) in Example 11.2.2. 
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y(t) = [200s (2nfit +b), O<t<T, 


2 
wn) = my sin (2x fat +o), O<t<T, (11.2.4) 


and sampling the output of the cross correlators at t = T. Thus, we obtain the received 
signal vector 


Ye = (|Ck| Ake + kr, |Crl]Aks + Nki), (11.2.5) 
which can also be expressed as the complex number 
Y; = |Ck|Xk + nk, (11.2.6) 


where nk = Nkr + jnki represents the additive noise. 
The scaling of the transmitted symbol by the channel gain |C;| can be removed by 
dividing Yx by |C,|. Thus, we obtain 


Y; = Yi /|Cel = Xk +m, (11.2.7) 


where n, = 7x/|C |. The normalized variable Y; is passed to the detector, which computes 
the distance metrics between Y; and each of the possible signal points in the QAM signal 
constellation and selects the signal point resulting in the smallest distance. 

From this description, it is clear that two cross correlators or two matched filters are 
required to demodulate the received signal in each subchannel. Therefore, if the OFDM 
signal consists of K subchannels, the implementation of the OFDM demodulator requires 
a parallel bank of 2K cross correlators or 2K matched filters. Furthermore, the modulation 
process for generating the OFDM signal can also be viewed as exciting a bank of 2K 
parallel filters with symbols taken from an M-ary QAM signal constellation. 

The bank of 2K parallel filters that generates the modulated signal at the transmitter 
and demodulates the received signal is equivalent to the computation of the discrete Fourier 
transform (DFT) and its inverse. Since an efficient computation of the DFT is the fast 
Fourier transform (FFT) algorithm, a more efficient implementation of the modulation and 
demodulation processes when K is large, e.g., K > 20, is by means of the FFT algorithm. 
In the next section, we describe the implementation of the modulator and demodulator in 
an OFDM system that uses the FFT algorithm to compute the DFT. 

Since the signals transmitted on the K subchannels of the OFDM system are syn- 
chronized, the received signals on any pair of subchannels are orthogonal over the interval 
O < t < T. If the subchannel gains |C;,|,0 < k < K — 1 are sufficiently different across 
the channel bandwidth, subchannels that yield a higher SNR due to a lower attenuation 
can be modulated to carry more bits per symbol than subcarriers that yield a lower SNR 
(high attenuation). Consequently, QAM with different constellation sizes can be used on 
the different subchannels of an OFDM system. This assignment of different constellation 
sizes to different subchannels is generally done in practice, e.g., in DSL modems. 
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11.3 AN OFDM SYSTEM IMPLEMENTED VIA THE FFT ALGORITHM 


In this section, we describe an OFDM system that uses QAM for data transmission on 
each of the subcarriers and the FFT algorithm in the implementation of the modulator and 
demodulator. 

The basic block diagram of the OFDM system is illustrated in Figure 11.4. A serial- 
to-parallel buffer subdivides the information sequence into frames of By bits. The B f 
bits in each frame are parsed into K groups, where the ith group is assigned b; bits. 
Hence, 











> bra Bp (11.3.1) 


We may view the multicarrier modulator as generating K independent QAM 
subchannels, where the symbol rate for each subchannel is 1/7 and the signal in each 
subchannel has a distinct QAM constellation. Hence, the number of signal points for the 
ith subchannel is M; = 2°. We will denote the complex-valued signal points correspond- 
ing to the information signals on the K subchannels by X;,k = 0,1,..., K — 1. These 
information symbols {X+} represent the values of the DFT of a multicarrier OFDM signal 
x(t), where the modulation on each subcarrier is QAM. Since x(t) must be a real-valued 
signal, its N-point DFT {X;} must satisfy the symmetry property Xn- = X;. Therefore, 
we create N = 2K symbols from K information symbols by defining 




















Xn- = Xi, k=1,2,...,K —1, (11.3.2) 
Xo = Re[Xol, (11.3.3) 
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Figure 11.4 Block diagram of a multicarrier OFDM digital communication system. 

















i eat as as ait Oc lh 











Section 11.3 An OFDM System Implemented via the FFT Algorithm 627 
Note that the information symbol Xo is split into two parts, both of which are real. If we 


denote the new sequence of symbols as {X,,k = 0,1,...,N — 1}, the N-point inverse 
DFT (IDFT) yields the real-valued sequence 


=e 
Xn = yx el2nnk/N 
N k=0 





K-1 
1 27nk 
= ——=]Re[Xo]+(-1)” Im[Xo] +2 [Xul 00s( +a) , n=0,1,...,N—1, 
JN | 2 N 
(11.3.5) 
where the information symbol X; = | Xx jek, 


The sequence {xn,0 < n < N — 1} corresponds to samples of the multicarrier 
OFDM signal x(t) which consists of K subcarriers f, = k/T, k = 0,1,2,...,K — 1, 
where T is the signal duration. We note that the information symbol Xo corresponds to the 
DC component (fo = 0). For convenience, let us set Xo = 0, so that the OFDM signal has 
no DC component. Then the transmitted OFDM signal, which is generated by passing the 
signal samples {xn} through a digital-to-analog (D/A) converter, may be represented as 


2nkt 
i +61), O<1<T (11.3.6) 





2 Ko 
x(t) = JN D |X;,| cos ( 
k=1 


Example 11.3.1 
Using the M = 16 QAM rectangular signal constellation shown in Figure 8.54, select pseu- 
dorandomly each of the information symbols X1, X2, X3, X4. With T = 100 sec, generate the 
signal waveform x(t) given by Equation (11.3.6) and plot x(t). Also, compute the IDFT val- 
ues of {x,}, forO < n < 9 given by Equation (11.3.5) and demonstrate that x, = x (nT/N) = 
x(10n), O<n <9. 


Solution In this example K = 5 and N = 2K = 10. The graph for x(t) is shown in 


Figure 11.5. The values of the IDFT are x = 0,x; = —2.1151,x. = —0.7265,x3 = 
—8.0403, x4 = —3.0777, x5 = 8, x6 = 3.0777, x7 = —0.4318, xg = 0.7265, x9 = 2.5872. It 
is easily verified that x(nT/N) = xn. r 


With x(t) as the input to the channel, the channel output at the receiver may be 
expressed as 
r(t) = x(t) *c(t) +n), (11.3.7) 


where c(t) is the impulse response of the channel and * denotes convolution. Since the 
bandwidth Af of each subchannel is selected to be very small relative to the overall chan- 
nel bandwidth W = K Af, the symbol duration T = 1/A/ is larger than the duration of 
the channel impulse response. To be specific, suppose that the channel impulse response 
spans m +1 signal samples, wherem < N.A simple way to completely avoid intersymbol 
interference (ISI) is to insert a time guard of duration mT/N between the transmission of 
successive data blocks. This allows the response of the channel to die out before the next 
block of K symbols is transmitted. 

An alternative method of avoiding ISI is to append a so-called cyclic prefix to each 
block of N signal samples {x,,0 < n < N — 1}. The cyclic prefix for the block of 
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samples contains the samples Xy—m,XN-m+1,---,Xy—1. These samples are appended to 
the beginning of the block, thus creating a signal sequence of length N + m samples, 
which may be indexed from n = —m ton = N — 1, where the first m samples constitute 


the cyclic prefix. Then, if the sample values of the channel response are {c,,0 < n < m}, 
the convolution of {c,} with {x,,—m < n < N — 1} produces the received signal {r,}. 
Since the ISI in any pair of successive signal transmission blocks affects the first m signal 
samples, we discard the first m samples of {r„} and demodulate the signal based on the 
received signal samples {r,,0 <n < N — 1}. 
Example 11.3.2 

Consider the signal generated in Example 11.3.1. Suppose that the channel impulse response 


c(t) spans the time interval 0 < t < 15 sec. Determine the cyclic prefix to be appended to the 
IDFT sequence {xn}. 


Solution With T = 100 sec, the channel span is smaller than the time duration between 
the two IDFT samples. Thus, we select m = 2, and the cyclic prefix consists of the samples 


{xx-2, Xw-i} = {x8, Xo}. r 


If we view the channel characteristics in the frequency domain, the channel fre- 
quency response at the subcarrier frequencies fg = k/T is 


m 


2mk 
oe (Fr) g de eri, k= 01,...,N—1, 11.3.8 
Since the ISI is eliminated through either the cyclic prefix or the time guard band, the 
demodulated sequence of symbols may be expressed as 
R= CX +m, k=0,1,...,N—1, (11.3.9 


where {X;} is the output of the N-point DFT computed by the demodulator and {ng} is the 
additive noise corrupting the signal. 
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As illustrated in Figure 11.4, the received signal is demodulated by computing the 
DFT of the received signal after it has passed through an analog-to-digital (A/D) con- 
verter. As in the case of the OFDM modulator, the DFT computation at the demodulator is 
performed efficiently with the use of the FFT algorithm. 

To recover the information symbols from the values of the computed DFT, it is nec- 
essary to estimate and compensate for the channel factors {C}. The channel measurement 
can be accomplished by initially transmitting either a known modulated sequence on each 
of the subcarriers or simply by transmitting the unmodulated subcarriers. If the channel 
characteristics vary slowly with time, the time variations can be tracked by using the deci- 
sions at the output of the detector in a decision-directed manner. Thus, the multicarrier 
OFDM system can be made to operate adaptively. The transmission rate on each subcarrier 
can be optimized by properly allocating the average transmitted power and the number of 
bits that are transmitted by each subcarrier. The SNR per subchannel may be defined as 


2 
SNR; = — 3; (11.3.10) 


where T is the symbol duration, Pą is the average transmitted power allocated to the 
kth subchannel, |C;|* is the squared magnitude of the frequency response of the kth sub- 
channel, and oh is the corresponding noise variance. In subchannels with high SNR, we 
transmit more bits/symbol by using a larger QAM constellation than we would use with 
subchannels with low SNR. Thus, the bit rate on each subchannel can be optimized so 
that the error-rate performance among the subchannels is equalized to satisfy the desired 
specifications. 

Multicarrier OFDM using QAM modulation on each of these subcarriers has been 
implemented for a variety of applications, including high-speed transmission over tele- 
phone lines, such as digital subscriber lines (DSL). This type of multicarrier OFDM mod- 
ulation has also been called discrete-multitone (DMT) modulation. Multicarrier OFDM is 
also used in digital audio broadcasting in Europe and other parts of the world, as well as in 
wireless LANs. 


11.4 SPECTRAL CHARACTERISTICS OF OFDM SIGNALS 


Although the signals transmitted on the subcarriers of an OFDM system are mutually 
orthogonal in the time domain, i.e., 


T 
i up(t)uj(t)dt =0 kj, (11.4.1) 
0 





where u(t) is defined in Equation (11.2.1), these signals have significant overlap in the 
frequency domain. This can be observed by computing the Fourier transform of the signal 


: 2 
a u;(t) = e| Zea 


= |Z Arcos Omsft +00), O<t<T (11.4.2) 
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Figure 11.6 An example of the magnitude of the frequency response of adjacent subchannel filters in OFDM 
system for f € (0, 0.06%) and K = 64. (From Cherubini et al. (2002) IEEE.) 


for several values of k. Figure 11.6 illustrates the magnitude spectrum |U;(f)| for three 
adjacent subcarriers. Note the large spectral overlap of the main lobes. Also note that the 
first sidelobe in the spectrum is only 13 dB down from the main lobe. Hence, there is a sig- 
nificant amount of spectral overlap among the signals wansmitted on different subcarriers. 
Nevertheless, these signals are orthogonal when transmitted synchronously in time. 

The large spectral overlap of the OFDM signals has various ramifications when the 
communication channel is a fading channel. In Chapter 14, we will see that signal fading, 
which is caused by time-varying multipath propagation, results in Doppler spreading of 
the transmitted signal. The multipath propagation of the signal components in the OFDM 
signal destroys the orthogonality among the subcarriers; when combined with the Doppler 
spreading, it results in intersubchannel interference (ICI). This ICI produces a significant 
degradation in the performance (error probability) of the OFDM system. Consequently, 
OFDM may not be as robust as a single carrier system in radio communications where 
the receiving terminal is moving at high speed. On the other hand, ICI is not a serious 
problem in OFDM systems in which the receiving terminal is moving at a low speed, e.g., 
pedestrian speed. This is the case, for example, in wireless LANs that employ OFDM 
signals with large (M = 64) QAM signal constellations. 

Another type of multicarrier modulation is much more robust than OFDM in the 
presence of ICI resulting from a large Doppler spread due to high terminal speeds; this is 
called filtered multitone (FMT) modulation. An FMT signal is also generated and demod- 
ulated through a bank of parallel filters. However, the filters in FMT are designed to 
have negligible spectral overlap and extremely sharp frequency roll-off characteristics. 
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Figure 11.7 An example of the magnitude of the frequency response of adjacent subchannel filters in an FMT 
system for f € (0, 0.06%) and design parameters K = 64. (From Cherubini et al. (2002) IEEE.) 


For example, Figure 11.7 illustrates the frequency-response characteristics in an FMT sys- 
tem. Note that the filter sidelobes are at least 70 dB below the main lobe and the spectral 
overlap between adjacent filters is negligible. Such filter characteristics provide signifi- 
cant immunity against ICI that may be encountered in highly mobile radio communication 
environments. Another advantage of FMT is that the signals transmitted on the different 
subcarriers need not be synchronous. 

The large immunity against ICI provided by FMT comes at a price in bandwidth 
efficiency. In papers that describe the design of FMT systems, Cherubini et al. (2000, 2002) 
demonstrate that the increase in bandwidth compared with a conventional OFDM system 
can be made relatively modest, of the order of 10% to 20%. Therefore, FMT provides an 
alternative to conventional OFDM in high-mobility applications. 


11.5 PEAK-TO-AVERAGE POWER RATIO IN OFDM SYSTEMS 


A major problem with multicarrier modulation in general and OFDM systems in particular 
is the high peak-to-average power ratio (PAR) that is inherent in the transmitted signal. 
Large signal peaks occur in the transmitted signal when the signals in the K subchannels 
add constructively in phase. Such large signal peaks may saturate the power amplifier at the 
transmitter, thus, causing intermodulation distortion in the transmitted signal. Intermodu- 
lation distortion can be reduced and often avoided by reducing the power in the transmitted 
signal and, thus, operating the power amplifier at the transmitter in the linear range. Such a 
power reduction or “power back-off” results in inefficient operation of the OFDM system. 
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For example, if the PAR is 10 dB, the power back-off may be as much as 10 dB to avoid 
intermodulation distortion. 


Example 11.5.1 
Generate samples of the OFDM signal 





where K = 32, T = 1 sec, the sample rate F, = 200 samples per second, and the modula- 
tion on each subcarrier is four-phase PSK; i.e., 0, takes the possible values 0, 1/2, x, 31/2, 
selected pseudorandomly. For each realization of x(t), determine the PAR. Repeat the com- 
putation of the PAR for 20 different realizations of x(t) and plot the values of the PAR for the 
20 different realizations. 


Solution The average power of the sample {xn} is 


n=0 
and the peak power is 
Poeak = max{x,}. 
Hence, the PAR = Preax/ Pay. The plot of the PAR is shown in Figure 11.8. | 


Various methods have been devised to reduce the PAR in multicarrier systems. One 
of the simplest methods is to insert different phase shifts in each of the subcarriers. These 
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Figure 11.8 PAR values in Example 11.5.1. 
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phase shifts can be selected pseudorandomly, or by means of some algorithm, to reduce 
the PAR. For example, we may have a small set of stored pseudorandomly selected phase 
shifts that can be used when the PAR in the modulated subcarriers is large. The information 
on which set of pseudorandom phase shifts is used in any signal interval can be transmitted 
to the receiver on one of the K subcarriers. Alternatively, a single set of pseudorandom 
phase shifts may be employed, where this set is found via computer simulation to reduce 
the PAR to an acceptable level over the ensemble of possible transmitted data symbols on 
the K subcarriers. 

Another method that can reduce the PAR is to modulate a small subset of the sub- 
carriers with dummy symbols, which are selected to reduce the PAR. Since the dummy 
symbols do not have to be constrained to take amplitude and phase values from a speci- 
fied signal constellation, their design is very flexible. The subcarriers carrying the dummy 
symbols may be distributed across the frequency band. Since modulating subcarriers in 
this manner results in a lower throughput in data rate, we want to employ only a small 
percentage of the total subcarriers. 

Alternatively, the PAR may be kept within a specified limit by clipping the signal at 
the D/A converter. The clipping generally distorts the signal at the transmitter and, hence, 
degrades the performance at the receiver. 

Because of its practical importance, the problem of PAR reduction in multicarrier 
communication systems has been thoroughly investigated and other methods have been 
devised. The interested reader may refer to the literature cited in Section 11.7. 


11.6 APPLICATIONS OF OFDM 


OFDM is used in a variety of digital communication systems, including digital audio 
broadcasting (DAB), digital video broadcasting (DVB), high-speed transmission over tele- 
phone lines such as digital subscriber lines (DSL), and wireless local area networks (LANs). 
In this section, we briefly describe three of these applications. 


11.6.1 Digital Subscriber Lines 


As a first application of OFDM, consider high-speed digital transmission over wirelines 
that connect a telephone subscriber’s premises to a telephone central office. These wireline 
channels typically consist of unshielded twisted-pair wire and are commonly called the 
subscriber local loop. The desire to provide high-speed internet access to homes and busi- 
nesses over the telephone subscriber loop has resulted in the development of a standard for 
digital transmission based on OFDM with QAM as the basic modulation method on each 
of the subcarriers. 

The usable bandwidth of a twisted-pair subscriber loop wire is primarily limited by 
the distance between the subscriber and the central telephone office, i.e., the length of the 
wire, and by crosstalk interference from other lines in the same cable. For example, a 3 km 
twisted-pair wireline may have a usable bandwidth of approximately 1.2 MHz. Since the 
need for high-speed digital transmission is usually in the direction from the central office 
to the subscriber (the downlink) and the bandwidth is relatively small, the major part of 
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the bandwidth is allocated to the downlink. Consequently, the digital transmission on the 
subscriber loop is asymmetric, and this transmission mode is called ADSL (asymmetric 
digital subscriber line). 

In the ADSL standard, the downlink and the uplink maximum data rates are speci- 
fied as 6.8 Mbps and 640 kbps, respectively, for subscriber lines of approximately 12,000 
feet in length, and 1.544 Mbps and 176 kbps, respectively, for subscriber lines of approxi- 
mately 18,000 feet in length. The low part of the frequency band (0-25 kHz) is reserved for 
telephone voice transmission, which requires a nominal bandwidth of 4 kHz. Hence, the 
frequency band of the subscriber line is separated into two frequency bands via two filters 
(lowpass and highpass) that have cutoff frequencies of 25 kHz. Thus, the low end frequency 
for digital transmission is 25 kHz. The ADSL standard specifies that the frequency range of 
25 kHz to 1.1 MHz must be subdivided into 256 parallel OFDM subchannels. Hence, the 
size of the DFT and IDFT in the system implementation shown in Figure 11.4 is N = 512. 
A sampling rate F; = 2.208 MHz is specified, so that the high-end frequency in the signal 
spectrum is F;/2 = 1.104 MHz. The frequency spacing between two adjacent subcarriers 
is Af = 1.104 x 106/256 = 4.3125 kHz. The channel time dispersion is suppressed by 
using a cyclic prefix of N/16 = 32 samples. 

By measuring the signal-to-noise ratio (SNR) for each subchannel at the receiver 
and communicating this information to the transmitter via the uplink, the transmitter can 
select the QAM constellation size in bits/symbol to achieve a desired error probability 
in each subchannel. The ADSL standard specifies a minimum bit load of 2 bits per sub- 
channel, which corresponds to QPSK modulation. If a subchannel cannot support QPSK 
at the desired error probability, no information is transmitted over that subchannel. As an 
example, Figure 11.9 illustrates the received SNR as measured by the receiver for each 
subchannel and the corresponding number of bits/symbol selected from a QAM signal 
constellation. Note that the SNR in subchannels 220-256 is too low to support QPSK 
modulation; hence, no data are transmitted on these subchannels. ADSL channel char- 
acteristics and the design of OFDM modems based on the ADSL standard are treated 
in detail in the books by Bingham (2000) and Starr et al. (1999). The use of OFDM 
with variable size QAM signal constellations for each of the subcarriers is often called 
discrete multitone (DMT) modulation. 
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Figure 11.9 Example of a DSL frequency response and bit allocation on the OFDM subchannels. 
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11.6.2 Wireless LANs 


Wireless local area network (LAN) standards have been developed over the past few years 
by working groups within the IEEE (Institute of Electrical and Electronics Engineers) and 
other international standards organizations. These standards make it possible to provide 
high-speed wireless access to the internet. We will focus on the IEEE 802.11a standard, 
which is based on OFDM. A typical configuration is illustrated in Figure 11.10, where 
access point (AP) terminals communicate with several users (laptop computers). 

The United States’s Federal Communications Commission (FCC) has allocated a 
300 MHz spectrum in the 5.2 GHz frequency band to provide wireless LAN services 
based on the 802.1 la standard. A nominal channel bandwidth of 20 MHz is subdivided 
into 52 subchannels with subcamier frequency spacing of 312.5 kHz. The OFDM symbol 
duration is 4 usec, and the length of the cyclic prefix (guard interval) is 0.8 usec. By 
using 48 subchannels to carry data via either BPSK, QPSK, 16-QAM, or 64-QAM, we can 
achieve (uncoded) data rates of 12-72 Mbps. With channel coding that is usually used to 
correct transmission errors, as discussed in Chapter 13, the achievable data rates in 802.11la 
are 6,9, 12, 18, 24, 36, 48, and 54 Mbps. In the remaining four subchannels, pilot tones are 
transmitted to measure and correct frequency offsets that may occur in the received signal 
due to terminal mobility. 

The multiple access scheme employed in 802.1 la is called carrier sense multiple 
access with collision avoidance (CSMA/CA). In simple terms, a terminal, before starting 
transmission, senses to see if the channel is available for signal transmission. If no other 
signal is detected to be above a set threshold, a packet of the signal is sent. After success- 
ful reception, the recipient sends back an acknowledgement. After receiving the acknow- 
ledgement, the user waits for a certain randomly selected time interval before sending 
another packet. This standard operates in the 5.2-GHz frequency band; the IEEE 802.11 g 
standard, which is related, operates in the 2.4-GHz frequency band, where OFDM with 
QAM modulation is employed. 
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Figure 11.10 A wireless LAN configuration. 
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11.6.3 Digital Audio Broadcasting 


Digital audio broadcasting (DAB) systems may be terrestrial, satellite based, or a combina- 
tion of the two. Eventually, these systems will replace the analog AM and FM broadcasting 
systems currently in use. Various standards organizations have established standards for 
DAB in many countries. We shall focus on the European Eureka-147 standard established 
by the ITU (International Telecommunications Union) and ETSI (European Telecommu- 
nication Standard Institute). 

The Eureka-147 DAB is a standard that operates in four different modes, where each 
mode is tailored to a specific frequency band and corresponding application. OFDM is used 
in all modes. The modulation used on each subchannel is differential quadrature phase-shift 
keying (QPSK). For example, mode 1 is employed in terrestrial broadcasting in the very 
high frequency band. In this mode, there are 1536 subcarriers with a subchannel spacing 
of 1 kHz. The symbol duration on each subcarrier is 1 msec and the frame duration is 
96 msec. The duration of the cyclic prefix is 246 usec. Another mode is designed for satel- 
lite transmission at frequency bands up to 3 GHz. For this mode, there are 192 subchannels, 
with an adjacent subchannel separation of 8 kHz. The symbol duration is 125 usec and the 
frame duration is 24 msec. The cyclic prefix duration is 31 usec. 

To conserve bandwidth, MPEG audio compression is employed in Eureka-147DAB. 
The audio quality achieved is comparable to CD quality. 

For more information on digital audio broadcasting, refer to the book by Hoeg and 
Lauterback (2001) and the paper by Layer (2001). 


11.7 SUMMARY AND FURTHER READING 


The focus of this chapter is on digital signal transmission via multicarrier modulation 
and orthogonal frequency-division multiplexing (OFDM). In multicarrier modulation, the 
available channel bandwidth W is subdivided into a number of equal-bandwidth subchan- 
nels, where the bandwidth of each subchannel is sufficiently narrow, so that within the 
frequency band of each subchannel, the frequency response is constant in both magnitude 
and phase. Thus, we create K subchannels in which K information symbols are trans- 
mitted simultaneously by modulating the subcarrier frequencies corresponding to the K 
subchannels. By selecting the symbol rate 1/T on each of the subchannels to be equal to 
the frequency separation between adjacent subcarrier frequencies, the signals transmitted 
in the K channels are orthogonal over the symbol interval T. Thus, we constructed an 
| OFDM signal. 

As a consequence of the orthogonality property of the signals on the K subchan- 
nels, the discrete-time signal that is formed at the transmitter is simply the inverse discrete 
Fourier transform (IDFT) of the K modulated subcarriers. Therefore, the computation of 
the IDFT at the transmitter is efficiently implemented by using the (inverse) fast Fourier 
transform (FFT) algorithm. The discrete-time modulated signal samples from the (inverse) 
FFT are passed to the D/A converter to produce a continuous-time signal that is trans- 
mitted on the channel. On the receiver side, the received signal is passed through an A/D 
converter and the samples from the converter are fed to the demodulator. Since the signal 
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samples are the IDFT of the modulated subcarriers, the demodulator computes the DFT of 
these samples to recover the information symbols on the Ķ subcarriers. Again, the DFT is 
computed efficiently by employing the FFT algorithm. To compensate for the effect of the 
channel on the information symbols carried in each subchannel, it is necessary to measure 
the magnitude and phase shift of the channel at each subcarrier. Such a measurement is 
obtained by transmitting pilot signals imbedded in the OFDM signal. For example, a small 
number of the OFDM subcarriers may be allocated as pilot subcarriers. 

We also described the use of a cyclic prefix in the FFT implementation of the modu- 
lator and demodulator. The cyclic prefix serves the purpose of a guard interval for elimi- 
nating ISI between successive OFDM blocks. The cyclic prefix samples inserted at the 
transmitter are discarded at the input to the FFT demodulator. Thus, any channel-induced 
time dispersion on the transmitted signal is eliminated. 

A view of the spectral characteristics of the OFDM signal shows that there is signifi- 
cant frequency-domain overlap among the subcarrier signals. Nevertheless, the subcarrier 
signals are orthogonal in the time domain over their time duration T. 

We emphasized that a disadvantage in the transmission of OFDM signals is their high 
peak-to-average power ratio (PAR), which may cause saturation of the transmitter power 
amplifier and result in signal distortion, usually called intermodulation distortion. Several 
methods were described to limit the value of the PAR in the OFDM signal. 

In the final section of the chapter, we described three applications of signal transmis- 
sion employing OFDM. The applications cited are (a) digital subscriber lines that are used 
to provide intemet services to subscribers using the telephone network, (b) wireless LANs, 
such as WiFi that provide internet access to mobile users, and (c) digital audio broadcasting 
systems that replace the current analog AM and FM broadcasting systems. 

There is a large amount of literature on multicarrier digital communication systems. 
One of the earliest systems, described by Doeltz et al. (1957), is called Kineplex; it was 
used for digital transmission in the high-frequency (HF) radio band. Other early work 
on multicarrier system design is described in the papers by Chang (1966) and Saltzberg 
(1967). The use of DFT for the modulation and demodulation of multicarrier OFDM 
systems was proposed by Weinstein and Ebert (1971). More recent references on appli- 
cations of OFDM in practical systems are papers by Chow et al. (1995) and Bingham 
(1990). The book by Bahai and Saltzberg (1999) provides a comprehensive treatment 
of OFDM. 

The problem of PAR reduction in multicarrier systems has been investigated by many 
people. The reader is referred to the papers by Boyd (1986), Popovic (1991), Jones et al. 
(1994), Wilkinson and Jones (1995), Wulich (1996), Tellado and Cioffi (1998), and Tarokh 
and Jafarkhani (2000). 


PROBLEMS 


11.1 Show that the sequence {xn, 0 < n < N—1} given by Equation (11.3.5) corresponds 
to the samples of x(t) given by Equation (11.3.6). Also, prove that x(t) given by 
Equation (11.3.6) is a real-valued signal. 
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11.2 


11.3 


11.4 
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Show that the IDFT of a sequence {X}, 0 < k < N — 1} can be computed by 
passing the sequence {X,} through a parallel bank of N linear discrete-time filters 
with system functions 


1 
Hn (2) = 7 n=0,1,...,N—-1, 


— ej2nn/Nz-1’ 
where the filter outputs are sampled atn = N. 


Assess the cost of the cyclic prefix (used in multicarrier modulation to avoid ISI) in 
terms of 


L Extra channel bandwidth. 

2. Extra signal energy. 
Let x[n] be a finite-duration signal with length N and let X[k] be its N-point DFT. 
Suppose we pad x[n] with L zeros and compute the (N + L)-point DFT, X’[k]. What 


is the relationship between X [0] and X’[0]? If we plot |X [k]| and |X’[k]| on the same 
graph, explain the relationship between the two graphs 


COMPUTER PROBLEMS 


11.1 


11.3 


Subcarrier Demodulation 


Consider the noise-free received signal r(t) in Example 11.2.2. Perform the demo- 
dulation steps given in Equations (11.2.4) through (11.2.7) and, thus, demonstrate 
that the demodulated symbol is X6 = 3 + j1. 


Generation of an OFDM signal for 16-point QAM 


Using the 16-point QAM signal constellation shown in Figure 8.54(a), select pseu- 
dorandomly each of the information symbols Xo, X1, X2,..., X9. With T =100 sec, 
generate the transmitted signal waveform x(t) given by Equation (11.3.6) for 
t = 0,1,...,100 and plot it. Then compute the IDFT values x, for n = 0, 
1,..., N—1, by using Equation (11.3.5). Demonstrate that x(t), evaluated at Tn/N, 
n=0,1,..., N—1, corresponds to the IDFT values. Finally, using the IDFT values 
{xn,0 <n < N — 1}, compute the DFT, defined as 


N-1 
1 on it 
xX, = — J me IN k=0, 1,...,N—1; 
VN a 


thus, demonstrate that the information symbols {X;, 1 < k < 9} are recovered from 
the samples of x(t), where t = nT/N,O<n<N-1., 
Generation of an OFDM Signal for Four-phase PSK 


Repeat Computer Problem 11.2 by generating an OFDM signal that transmits 
the K = 16 information symbols (Xo, X1,..., X15) selected pseudorandomly from 
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11.6 


11.10 


11.11 


the four-phase PSK constellation. Use the FFT algorithm to compute the DFT 
and IDFT. 

Computation of the IDFT of OFDM Signal 

Compute the IDFT of the OFDM signal given in Equation (11.3.5), where K = 16 
and the symbols (Xo, Xj, ..., X15) selected from the four-phase PSK constellation. 
Plot the magnitude |X (f)| for the OFDM signal. 

Use of Cyclic Prefix 

Using the OFDM signal generated in Computer Problem 11.2, add a cyclic prefix of 
four samples to account for the channel dispersion and, thus, modify the MATLAB 
script given in Computer Problem 11.2. 

Spectrum of OFDM Signal 


Determine the magnitude |U;,(f)| of the Fourier transform of the signal given by 
Equation (11.4.2) for fe = k/T, k = 0,1, 2, 3, 4, 5. For simplicity, let A, = 1 
and 6, = 0 forall k. Plot |U,(f)| on the same graph for k = 0, 1, 2, 3, 4,5 for 
0<f <4/T. 

Computation of PAR for QAM OFDM Signal 

Repeat the PAR computation in Example 11.5.1 for the M = 16 QAM OFDM 
signal generated in Computer Problem 11.2. Compare the computed PAR for QAM 
OFDM signal with the PSK OFDM signal. 

Computation of PAR for PSK OFDM Signal 

Repeat the PAR computation in Example 11.5.1 for K = 128 andcompare the result 
with K = 32. 

Limiting the PAR by Clipping the Peaks 


Repeat Example 11.5.1, but in this case clip the peak amplitude of the samples so 
that the PAR < 3 dB. Define the clipped signal as {*,} and compute the signal 


distortion defined as 
199 


= oan 
P= 399 Dn — $n) . 


Plot D for the 20 signal realizations. 
Repeat Computer Problem 11.9 for the PAR < 4 dB and for the PAR < 6 dB. 


Reduction of PAR by Random Phase Shifting 
Generate samples of the OFDM signal 


K-1 


2ukt 
x(0) = 008 (7 +0, +h), 0<t<T, 


k=1 
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where the phase ¢x is selected from a uniform distribution on interval (0, 21) and the 
remaining signal parameters are identical to those in Exaple 11.5.1. Generate four 
sets of uniformly distributed phases {@,}, and for each realization of x(t), select the 
set that yields the smallest PAR. Repeat this process for 20 different realizations of 
x(t) and plot the resultant PAR for the 20 realizations of x(t). 


11.12 Repeat Computer Problem 11.11 for K 


= 8, four-phase PSK transmitted 
symbols. 
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In Chapter 1, we saw that the essential parts of any communication system are the informa- 
tion source, the communication channel, and the destination. We also saw that these three 
components are usually the given parts of a communication system, i.e., the communica- 
tion engineer usually does not have much control over them. We also saw that two systems, 
i.e., the transmitter and the receiver, connect the information source to the channel and the 
channel to the destination, respectively. These two systems are completely designed by the 
communication engineer; therefore, they are under the full control of the designer. The role 
of these systems is to match the output of their preceding systems (the information source 
in the case of the transmitter, and the channel in the case of the receiver) to the system that 
comes after them (the channel in the case of the transmitter, and the destination in the case 
of the receiver). Another goal in the design of the transmitter and the receiver is to make 
sure that the signal will be resistant to channel impairments during the transmission. This 
results in different design techniques and methodologies, depending on the properties of 
the channel. For instance, in designing signals for transmission over a bandlimited channel, 
the problems of distortion and intersymbol interference must be considered in the design 
of the transmitted signal. When many transmitters and receivers share the same commu- 
nication medium, as in the case of wireless communication, the problem of interference 
among different users should be addressed as well. 

In Chapters 3, 4, 6, 7, 8, 9, 10, and 11, we studied different communication situations 
and signal design techniques. In Chapters 3, 4, and 6, we studied analog communication 
systems and the signal-to-noise ratio (SNR) that each system can provide at the output. In 
Chapters 7, 8, 9, 10, and 11, we showed how analog signals can be transformed to digital 
signals and how digital signals can be transmitted. None of these chapters studied the 
fundamental limits on communications. Even in Chapters 8, 9, 10, and 11, where optimal 
receivers for a given signal set were designed, we did not design the optimal signal set; 
thus, there was no discussion of overall communication system optimality. 

In this chapter, we study communication systems from another point of view. We 
study fundamental limits on the representation and transmission of information. In other 
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words, we try to determine the limits of communication. We will answer questions such as 
the following: What is the highest rate at which information can be reliably transmitted over 
a communication channel? What is the lowest rate at which information can be compressed 
and still be retrievable with small or no error? What is the complexity of such optimal 
systems? These questions belong to a branch of communication theory called information 
theory, a field founded in 1948 by the pioneering work of Claude E. Shannon. 

Shannon’s fundamental contributions can be classified into two main categories. The 
first category consists of fundamental limits that apply to information sources. In this cate- 
gory, mathematical models for information sources are developed, and a method for mea- 
suring the information content of a source is introduced. Then the fundamental question 
of source coding is addressed and answered, i.e., what is the minimum rate at which a 
source can be compressed and still be recoverable from the compressed version? This is 
the essence of Shannon’s source-coding theorem. 

The second category of fundamental limits concems the transmission of information 
over noisy channels. The fundamental question is: What is the maximum rate at which 
information can be transmitted reliably over a noisy channel? This question is addressed 
in the well-known noisy channel coding theorem. 


12.1 MODELING INFORMATION SOURCES 


Communication systems are designed to transmit information. In any communication sys- 
tem, an information source produces the information. The purpose of the communication 
system is to transmit the output of the source to the destination. In radio broadcasting, for 
instance, the information source is either a speech source or a music source. In TV broad- 
casting, the information source is a video source, and the output is a moving image. In fax 
transmission, the information source produces a still image. In communication between 
computers, either binary data or ASCII characters (encoded as a sequence of binary digits) 
are transmitted; therefore, the source can be modeled as a binary or ASCII source. In the 
storage of binary data on a computer disk, the source is again a binary source. 

To study various information sources, we require a mathematical model to represent 
information sources and to measure their information content. Hartley, Nyquist, and Shan- 
non were the pioneers in defining quantitative measures for information. In this section, 
we investigate the mathematical modeling of information sources and define a measure of 
information. In the next few sections, we will see how the output of an information source 
can be made more compact for easier transmission or storage. The process of representing 
the output of an information source with a small number of bits is called data compression. 

The intuitive and common notion of information refers to any new knowledge about 
something. We can obtain information via hearing, seeing, or other means of perception. 
The information source, therefore, produces outputs that may interest the receiver of the 
information (who does not know these outputs in advance). The role of the communication 
system designer is to make sure that this information is transmitted to the receiver correctly. 
Since the output of the information source is a time varying unpredictable function (if it 
is predictable, there is no need to transmit it), it can be modeled as a random process. In 
previous chapters, we have already seen that the existence of noise in communication chan- 
nels causes stochastic dependence between the input and output of the channel. Therefore, 
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the communication system designer creates a system that transmits the output of a random 
process (information source) to a destination via a random medium (channel) and ensures 
low distortion. 

Information sources can be modeled by random processes, and the properties of the 
random process depend on the nature of the information source. Some basic characteristics 
of an information source include bandwidth, range of amplitude variations, power content, 
and statistical properties (e.g., probability density function (PDF) of the amplitude, station- 
arity, and power spectral density). For example, when we model speech signals, our result 
is a random process that has all its power in a frequency band of approximately 300-3400 
Hz. Therefore, the power spectral density of the speech signal also occupies this band of 
frequencies. A typical power spectral density for the speech signal is shown in Figure 12.1. 
Video signals are restored from a still or moving image; therefore, the bandwidth depends 
on the required resolution. For TV transmission, this band is typically 0-6.5 MHz. 

There is one common element in each of these processes: They are bandlimited pro- 
cesses and can be sampled at the Nyquist rate or faster and can be reconstructed from the 
sampled values. Therefore, it makes sense to limit this chapter to discrete-time random 
processes because all information sources of interest can be modeled by such a process, 
after they are sampled. The mathematical model for an information source is shown in 
Figure 12.2. Here, the source is modeled by a discrete-time random process {X;}?2_.,- 
The random variables X; are defined over an alphabet that can be either discrete (e.g., 
for binary data) or continuous (e.g., for sampled speech). The statistical properties of the 
discrete-time random process depend on the nature of the information source. 

Here, we will only study simple models for information sources. The study of more 
complicated models is mathematically demanding and beyond the scope of this book. How- 

_ ever, even simple models enable us to precisely define a measure of information and bounds 
on the compression and transmission of information. 

The simplest model for the information source that we study is the discrete memory- 
less source (DMS). A DMS is a discrete-time, discrete-amplitude random process in which 
all X;’s are generated independently and with the same diswibution. Therefore, a DMS gen- 
erates a sequence of i.i.d. (independent and identically distributed) random variables that 
take values in a discrete set. 


Sx(f) j 


—3400 —300 | 300 3400 f Figure 12.1 Typical power spectrum of a speech 
signal. 
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a ee discrete-time information source. 
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Let 4 = {a1, a2,...,an} denote the set in which the random variable X takes its 
values, and let the probability mass function for the discrete random variable X be denoted 
by pj = P(X =a;) fori = 1,2,..., N. A full dscripucn of the DMS is given by the st, 
A, called the alphabet, and the probabilities {pi W a 


Example 12.1.1 3 
An information source is described by the alphabet A = {0,1} and p(X; = 1l) = 


1 — P(X; = 0) = p. This is an example of a discrete memoryless source, and it gener- 
ates a sequence of zeros and ones; therefore, it is a binary source. In the special case where 
p = 0.5, the source is called a binary symmetric source, or BSS, for short. Py 


12.1.1 Measure of Information 


To provide a quantitative measure of information, we start with the basic model of an 
information source and define its information content to satisfy certain intuitive properties. 
Assume that the source that we are considering is a discrete and memoryless source. Let 
the outputs of this source be revealed to an interested party. Let a be the most likely output 
and ay be the least likely output. For example, we could imagine the source to represent 
both the weather conditions and air pollution in a certain city (in the northern hemisphere) 
during July. In this case, # represents various combinations of different weather conditions 
and pollution (such as hot and polluted, hot and lightly polluted, cold and highly polluted, 
cold and mildly polluted, and very cold and lightly polluted). The question is, Which out- 
put conveys more information, a; or ay (the most probable or the least probable output)? 
Intuitively, revealing ay (very cold and lightly polluted, in the previous example) reveals 
the most information. It follows that a rational measure of information for an output of an 
information source should be a decreasing function of the probability of that output. A sec- 
ond intuitive property of a measure of information is that a small change in the probability 
of a certain output should not drastically change the information delivered by that output. 
In other words, the information measure should be a continuous function of the probability 
of the source output. 

Now assume that the information about output a; can be subdivided into two inde- 
pendent parts called aj; and ajz, i.e., Xj = (Xj1, Xj2), aj = {aj1, aj2} and P(X = aj) = 
P(X j, = aj) P(X j2 = aj2). This can happen if we assume that the temperature and pol- 
lution were almost independent; therefore, each source output can be subdivided into two 
independent components. Since the components are independent, revealing the informa- 
tion about one component (temperature) does not provide any information about the other 
component (pollution); so, intuitively, the amount of information provided by-revealing a j 
is the sum of the two information components obtained by revealing a jı and a j2. From the 
preceding discussion, we can conclude that the amount of information revealed about an 
output a; with probability p; must satisfy the following conditions: 


1. The information content of output a; depends only on the probability of a; and not 
on the value of aj. We denote this function by /(p;), and we call it self information. 


2. Self information is a continuous function of pj, i.e., I (-) is a continuous function. 
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3. Self information is a decreasing function of its argument. 
4. If pj = PjıPj2, then I(p;) be T(pj) + I(pj2). 


It can be shown that the only function that satisfies all these properties is the logarithmic 
function, i.e., I(x) = — log(x). The base of the logarithm is not important and simply 
determines the unit by which the information is measured. If the base is 2, the information 
is expressed in bits per source symbol or bits per sample. Note that we can use the relation 
log, x = Tu to find logarithms! in base 2. 

Now that the information revealed about each source output a; is defined as the self 
information of that output, which is given by —log(p;), we can define the information 
content of the source as the weighted average of the self information of all source outputs. -- 
This is justified by the fact that the frequency of appearance of various source outputs is 
proportional to their corresponding probabilities. Therefore, the information revealed by 
an unidentified source output is the weighted average of the self information of the various 
source outputs, i.e., Bilt piiI (pi) = ae —pi log p;. The information content of the 
information source is known as the entropy of the source and is denoted by H (X). 


Definition 12.1.1. The entropy of a discrete random variable X is a function of its 
PMF and is defined by oe i 


N = N f 
1 
H(X) =— y pi log pi = X pi log (+) , g (12.1.1) 
i=1 o i=l : F 
where 0log 0 = 0. 7 | j i a 


Note that there is a slight abuse of notation here. We would expect H(X) to denote a 
function of the random variable X; hence, it would be a random variable itself. However, 

- H(X) is a function of the probability mass function (PMF) of the random variable X and 
is just a number. - 


Example 12.1.2 
For a binary memoryless source with probabilities p and 1 — p, we have 


H(X) = —plog p — (1 — p) log(1 — p). (12.1.2) 


This function, denoted by H,(p), is known as the binary entropy function, and a plot of it is 
given in Figure 12.3. a 


From the plot of the binary entropy function, we can see that the entropy of the binary 
memoryless source is zero when either p = 0 or p = 1. These two cases correspond to the 
time when the source generates all zeros or all ones. In both cases, the source is determinis- 
tic and completely predictable. On the other hand, entropy is maximized, with a maximum 
equal to 1, when the two source outputs are equiprobable and each has probability 5 This 
is the case where the output is least predictable; therefore, its entropy is maximized. This 
supports the intuitive development of entropy as a measure of the information content of a 
source (or measure of uncertainty of it). 


‘In this chapter, we use log x denote log, x. 
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0 Figure 12.3 The binary entropy 
0 01 02 03 04 05 06 07 08 09 1 P function. 


Example 12.1.3 


A source withthe bandwidth 4000 Hz is sampled at the Nyquist rate. Assuming that the result- 


ing sequence can be approximately modeled by a DMS with alphabet & = {—2, —1, 0, 1, 2} 


and with corresponding probabilities {3 a ee h determine the rate of the source in bits 


per second. 


Solution Wehave 


1 1 1 1 15 
A(X) = 3 l0s2 + q os4 + 3 log8 +2 x 7g 0816 =e bits/sample. 


Since we have 2 x 4000 = 8000 samples/sec, the source produces information at a rate of 
8000 x #2 = 15,000 bits/sec. m 


Example 12.1.4 


A discrete memoryless source has an alphabet of size N and the source outputs are equiprob- 
able (each having a probability of + ). Find the entropy of this source. 


Solution We have 


=logN. 0 (12.13) 
l a 


The entropy of a discrete random variable X with N possible values satisfies the 
lower and upper bounds given in Equation (12.1.4). 


0 < H(X) < logN. (12.1.4) 


The lower bound follows from the fact that because 0 < p; < 1, we have — log p; > 0. 
The lower bound H(X) = 0 is achieved when the random variable X is deterministic, 
i.e., it takes one of its possible values with probability 1, and the rest with probability 0. 
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The uppper bound H (X) = log N is achieved when the random variable is uniformly 
distributed (see Problem 12.7). 


12.1.2 Joint and Conditional Entropy 


“When dealing with two or more random variables, we can introduce joint and conditional 
entropies in exactly the same way that joint and conditional probabilities are introduced. 
These concepts are especially important when dealing with sources with memory. 

N 


Definition 12.1.2. The joint entropy of two discrete random variables (X, Y) is 
defined by 


H(X, Y) =—)~ p(x, y) log p(x, y). (12.1.5) 
x,y 
For the case of n random variables X = (X1, X2,..., Xn), we have 
H(X) =- >  p@X2 .-., Xn) log pP, X2 -+ -, Xn). (12.1.6) 
Pa Ee S n y 
E 


í 


As seen, the joint entropy is simply the entropy of a vector-valued random variable. 

The conditional entropy of the random variable X, given the random variable Y, 
can be defined by noting that if Y = y, then the PMF of the random variable X will be 
p(x|y) and the corresponding entropy is H(X|Y = y) = — }_, p(x|y) log p(x|y), which 
is intuitively the amount of uncertainty in-X-when we know that Y = y. The weighted 
average of these quantities over all y isthe uncertainty in X when Y is known. This“ quantity 
is known as the conditional nye and defined next. 


/ 
Definition 12.1.3. The conditional entropy of the random variable X, given the 
random variable Y, is defined by 


H(X|Y) = - J p(x, y) log p(y). (12.1.7) 
x,y 
In general, we have 
_ A(X,|X1,..., Xn-1) =— ys D(X, .--,%n) log pO |41, - - - , Xn—-1)- (12.1.8) 
eS Xis- Xn 
E 


Example 12.1.5 
Using the chain rule for PMFs, namely, A y) = p( ») P(xly), as given in Chapter 5, show 
that H(X, Y) = A(Y) + H(X|Y). 
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Solution From the definition of the joint entropy of two random variables, we have 


H(X, Y) = — J| p(x, y) log p(x, y) 


Xy `- 


=—) p(x, y) loglp 0) P@ly)] 


x,y 
=- p(x, y)log p(y) — $ p(x, y) log ply) 

x,y x,y 
= — > p(y) log p(y) — J pæ, y) log ply) 

PA . x,y 
= A(Y)+ A(X |Y), (12.1.9) 

where, in the last step, we have used 
>> p(x, y) = po). “(12.1.10 


This relation states that the information content of the pair (X, Y) is equal to the informa- 
tion content of Y plus the information content of X after Y is known. Equivalently, it states 
that the same information is transferred by either revealing the pair (X, Y) or by first reveal- 
ing Y and then revealing the remaining information in X. This relation can be generalized 
to the case of n random variables to show the chain rule for entropies, as follows: 


A(X) = A(X) + H(X2|X:) + --- + AK |X, X2; ..-, Xn-1). (12.1.11) 
In the case where the random variables (X1, X2,..., Xn) are independent, this relation 
reduces to 
H(X) =) H(X)). (12.1.12) 
i=l 


If the random variable X, denotes the output of a discrete (not necessarily mem- 
oryless) source at time n, then H(X |X,) denotes the new information provided by the 
source output X2 to someone who already knows the source output X,. Similarly, 
H(X,|X 1, X2,..., Xn_1) denotes the new information in X, for an observer. who has 
observed (X1, X2,..., Xn_1). The limit of this conditional entropy as n tends to infinity is 
known as the entropy rate of the random process. i 


Definition 12.1.4. The entropy rate of a stationary discrete-time random process is 
defined by 


H = lim H(Xn|X1, X2 ---» Xn-1). (12.1.13) 
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Stationarity ensures the existence of the limit. It can be proved that an altemative 
definition of the entropy rate for sources with memory is given by 


1 
H = lim —H(X1, X2,..., Xn). (12.1.14) 


n>œ n 


Entropy rate plays the role of entropy for sources with memory. It is basically a 
measure of the uncertainty per output symbol of the source. 


Example 12.1.6 
Two binary random variables X and Y are distributed according to the joint PMF given by 
P(X =0,Y =1)= L 
4 
EAE E e 
2 
P(X=0,¥ =O) = 7, 


Determine H(X, Y), H(X), H(Y), H(X|Y), and H(Y|X). 
Solution Wehave P(X = 1, Y = 0) = 0 and H(X, Y) = Lig : lig l Lio Le 
ear a IN oa OR ae aes 


3 
—. Next, we have 


PX =) = P(X SLY =O +PX=LY =) =5 > PX =0) = 5 


and 
Ba 
PU =I) =P =1Y=)+PR=0,7 =) =2 5 PY == 4. 
Thus, we conclude that x 
l l i lı 
H(X) = -3183 5 85 
and 
3 3 1 1 
H Y =--] --— ame: — = U. . 
(Y) 41°85 783 0.8113 
Now since 
A(X,Y) = H(X) + H(Y |X), 
Ee 
we have 
A(Y|X) =15-1=0:5 
and 


A(X|Y) = A(X, Y) — A(Y) = 1.5 — 0.8113 = 0.6887. bB 
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12.1.3 Mutual Information 


For discrete random variables, H (X |Y) denotes the entropy (or uncertainty) of the random 
variable X after the random variable Y is known. Therefore, if the entropy of the random 
variable X is H(X), then H(X) — H(X|Y) denotes the amount of uncertainty of X that 
has been removed by revealing random variable Y. In other words, H(X) — H(X|Y) is the 
amount of information provided by the random variable Y about random variable X. This 
quantity plays an important role in both source and channel coding and is called the mutual 
information between two random variables. 


Definition 12.1.5. The mutual information between two discrete random variables 
X and Y is denoted by I (X; Y) and is defined by 


I(X; Y) = H(X) — H(X|Y). | (12.1.15) 


Using the expression for entropy and Equation (12.1.7) for conditional entropy, we 
can determine the following relation for the mutual information between two random vari- 
ables X and Y: 





I(X;Y)= D A p(x, y) log ply) 


reZ yY p) 
S53 pe, y)log 22. (12.1.16) 
a PPO) 


Example 12.1.7 

Let X and Y be binary random variables with P(X = 0, Y =0) = is P(X =1,Y=0)= 
and P(X =0, Y =1) =}. Find /(X; Y) in this case. 

Solution We have, P(X = 0) = P(Y = 0) = 3; therefore, H(X) = H(Y) = H,(3) = 
0.919. On the other hand, the (X, Y) pair is a random vector uniformly distr®uted on three 
values: (0, 0), (1,0), and (0, 1). Therefore, H(X, Y). = log3 = 1.585. From this, we have 
A(X\|Y) = A(X, Y) — A(Y) = 1.585 — 0.919 = 0.666 and I (X; Y) = H(X) — A(X|Y) = 
0.919 — 0.666 = 0.253. E 


It can be shown (see Problem 12.31) that 
I(X; Y) = I (Y; X) = H(X)—H(X\Y) = H(Y)—H(Y|X) = H(X)+H(Y)— H(X, Y). 
= (12.1.17) 


Figure 12.4 represents the relation among entropy, conditional entropy, and mutual 
information quantities. 


12.1.4 Differential Entropy 
So far, we have defined entropy and mutual information for discrete sources. If we are 


dealing with a discrete-time continuous-alphabet source whose outputs are real numbers, 
no quantity exists that has the intuitive meaning of entropy. In the continuous case, another 
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H(Y) 


Figure 12.4 Entropy, 
conditional entropy, and 
mutual information quantities. 





quantity that resembles entropy, called differential entropy, is defined. However, it does 
not have the intuitive interpretation of entropy as the uncertainty in the source output. In 
fact, to reconstruct the output of a continuous source reliably, an infinite number of bits per 
source output are required because any output of the source is a real number and the binary 
expansion of a real number has infinitely many bits. 


N 
Definition 1231.6. The differential entropy of a continuous random variable X with 
the probability density function fx (x) is denoted by A(X) and defined by 


h(X) = -f fg(x) log fx(x)dx, (12.1.18) 


where 0log0 = 0. a 


Example 12.1.8 
Determine the differential entropy of arandom variable X uniformly distributed on [0, a]. 


Solution From the definition of differential entropy, 


1 1 
n(x) =— f -tog “dx = toga 
oa “a 


Clearly, fora < 1, we have A(X) < 0, which is in contrast to the nonnegativity of the entropy 
of discrete sources. Also, fora = 1, A(X) = 0 without X being deterministic. This is again in 
contrast to the entropy properties of discrete sources. a 


Example 12.1.9 
Determine the differential entropy of a zero-mean Gaussian random variable with variance o°. 


Solution The PDF is f(x) = 





2 ; 

ae e 2:2 , Therefore, using natural logarithms, we find 
mo 

the differential entropy to be 


h(X) = -fm (=) S (x)dx -F In (<*) f(x)dx 


202 





= sin (2xeo’) , (12.1.19) 
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where f°, f(x)dx = 1 and f°. x? f(x)dx = 07. Changing the base of the logarithms to 2, 
we have 


h(X) = 5 log,@neo?) bits. (12.1.20) 
a 


Extensions of the definition of differential entropy to joint random variables and 
conditional differential entropy are straightforward. For two random variables, we have 


æy = | i F(x, y)log f(x, y)dxdy . (12.1.21) 


and 
A(X |Y) = h(X, Y) — h(Y). (12.1.22) 


The mutual information between two continuous random variables X and Y is defined 
similarly to the discrete case as 


I(X; Y) =h(Y) —h(Y|X) = h(X) —h(X|Y). (12.1.23) 


Although differential entropy does not have the intuitive interpretation of discrete source 
entropy, the mutual information of continuous random variables has the same interpretation 
as discrete random variables, i.e., the information provided by one random variable about 
the other random variable. 





12.2 THE SOURCE CODING THEOREM 


hs 

The source coding theorem is one of the three fundamental theorems introduced by Shan- 
non (1948a, 1948b). The source coding theorem establishes a fundamental limit on the rate 
at which the output of an information source can be compressed without causing a large 
error probability. We have already seen that the entropy of an information source is a mea- 
sure of the uncertainty or, equivalently, the information content of the source. Therefore, 
it is natural that the entropy of the source plays a major role in the statement of the source 
coding theorem. 

The entropy of an information source has a very intuitive meaning. Assume that we 
are observing outputs of length n of a DMS where n is very large. Then, actording to the 
law of large numbers (see Chapter 5), there is a high probability (that goes to 1 as n — o0) 
that letter a; is repeated approximately np; times, letter a2 is repeated approximately np2 
times, ..., and letter ay is repeated approximately npy times. This means that when n 
is large enough, with a probability approaching 1, every sequence from the source has 
the same composition and therefore the same probability. The sequences x that have this 
structure are called typical sequences. Using the fact that the source is memoryless, the 
probability of a typical sequence is given by 
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N 
= I] ni log pi 


= 2” DP, Pi log pi 


= 27n H(X) 


This means that for large n, typical output sequences of length- of the source are equally 
probable with probability ~ 2~””, On the other hand, thé probability of the set of non- 
typical sequences is negligible. 

Since the probability of the typical sequences is almost 1 and each typical sequence 
has a probability of almost 2-”4, the total number of typical sequences is almost 2”7(*), 
Therefore, although a source of alphabet size N can produce N” sequences of length n, the 
“effective” number of outputs is 2”7, By “effective number of outputs,” we mean that 
almost nothing is lost by neglecting the other outputs and the probability of having lost 
anything goes to zero as n goes to infinity. Figure 12.5 gives a schematic diagram of this 
property. The notion of typicality and the properties of typical sequences are practically 
very important. It tells us that, for all practical purposes, it is enough to consider the set 
of typical sequences rather than the set of all possible outputs of the source. The error 
introduced in ignoring nontypical sequences can be made smaller than any given € > 0 by 
choosing an n that is large enough. This is the essence of data compression, the practice of 
representing the output of the source with a smaller number of sequences than the number 
of the outputs that the source really produces. 

From this result, and since € is an arbitrary positive number, we can only repre- 
sent the typical source outputs without introducing considerable error. Since the total 
number of typical sequences is roughly 2”%, we need nH (X) bits to represent them. 


Set of typical sequences with = 2"H (X) 
elements 


Gr 


Figure 12.5 Theset of typical and 
nontypical sequences. 
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However, these bits are used to represent source outputs of length n. Therefore, on the 
average, any source output requires H(X) bits for an essentially error-free representation. 
This once again justifies the notion of entropy as the amount of information per source 
output. 

So far, we have assumed that the source is discrete and memoryless; therefore, it 
can be represented by an i.i.d. random variable. Such a source can only be compressed 
if its PMF is not uniform. For X uniformly distributed, as shown in Example 12.1.4, we 
have H(X) = log N; therefore, 2"7(%) — 2710s — N”, This means that the “effective” 
number of source outputs of length n is equal to the total number of source outputs and no 
compression is possible. 

We have not considered the case where the source has memory. For a source with 
memory, the outputs of the source are not independent; therefore, previous outputs reveal 
some information about the future ones. This means that the rate at which fresh information 
is produced decreases as more and more source outputs are revealed. A classic example of 
such a case is printed English text, which shows a lot of dependency between letters and 
words (e.g., a “q” is almost always followed by a “u”, a single letter between two spaces is 
either an “T” or “a”). The entropy per letter for a large text of English is roughly the limit of 
H(X,,|X1, X2,..., Xn—1) as n becomes large (the entropy rate defined'in Section 12.1.2). 
In general for stationary sources, the entropy rate has the same significance as the entropy 
for the case of memoryless sources and defines the number of “effective” source outputs 
for any n that is large enough, i.e., 2”” where H is the entropy rate. j 

Studies with statistical models of printed English show that the entropy rate con- 
verges rather quickly; for n = 10, we are very close to the limit. These studies show that 
forn = 1, i.e., a memoryless source model, we have H(X) = 4.03 bits/letter. As the 
memory increases, the size of the space over which conditional probabilities are computed 
increases rapidly, and it is not easy to find the conditional probabilities required to com- 
pute the entropy rate. Some methods for estimating these conditional probabilities have 
been proposed in the literature and, based on these methods, the entropy of English is 
estimated to be around 1.3 bits/letter. (In these studies, only the 26 letters of the English 
alphabet and the space mark have been considered.) 

So far, we have given an informal description of the source coding theorem and 
justified it. A formal statement of the theorem, without proof, is given next. The interested 
reader is referred to the references at the end of this chapter for a proof. 

Source Coding Theorem A source with entropy (or entropy rate) H can be encoded 
with an arbitrarily small error probability at any rate R (bits/source output) as long as 
R > H. Conversely, if R < H, the error probability will be bounded away from zero, 
independent of the complexity of the encoder and the decoder employed. a 

This theorem, first'proved by Shannon (1948a), only gives necessary and sufficient 
conditions for the existence of source codes. It does not provide any algorithm for the 
design of codes that achieve the performance predicted by this theorem. In the next section, 
we present two algorithms for the compression of information sources. One is due to Huff- 
man (1952), and the second is due to Lempel and Ziv (1977, 1978). 
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12.3 SOURCE CODING ALGORITHMS 


In the preceding section, we observed that H, the entropy of a source, gives a sharp bound 
on the rate at which a source can be compressed for reliable reconstruction. This means 
that at rates above entropy, it is possible to design a code with an error probability as small 
as desired, whereas at rates below entropy, such a code does not exist. This important 
result, however, does not provide specific algorithms to design codes approaching this 
bound. In this section, we will introduce two algorithms to design codes that are close 
to the entropy bound. These coding methods are the Huffman coding algorithm and the 
Lempel-Ziv source coding algorithm. 


12.3.1 The Huffman Source Coding Algorithm 


In Huffman coding, fixed length blocks of the source output are mapped to variable length 
binary blocks. This is called fixed-to-variable length coding. The idea is to map the more 
frequently occurring fixed length sequences to shorter binary sequences and the less fre- 
quently occurring sequences to longer binary sequences, thus achieving good compres- 
sion ratios. In variable length coding, synchronization is a problem. This means that there 
should be only one way to parse the binary received sequence into code words. The next 
example clarifies this point. 


Example 12.3.1 
Assume that the possible outputs of an information source are {a,, a2, a3, a4, as} with the cor- 
responding probabilities {}, i, i, $, $ }. Consider the following four codes for this source: 





Letter Probability Code words 
i Code 1 Code2 Code3 Code 4 


a p= ; 1 1 0 00 
a p= ; 01. 10 10 01 
a3 p= ; 001 100 110 10 
as p= 5 0001 1000 1110 1 
a p= 5 00001 10000 1111 110 


In the first code, each code word ends with a 1. Therefore, as soon as the decoder 
observes a 1, it knows that the code word has ended and a new code word will start. This 
means that the code is a self-synchronizing code. In the second code, each code word starts 
with a 1. Therefore, upon observing a 1, the decoder knows that a new code word has started; 
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hence, the previous bit was the last bit of the previous code word. This code is again self- 
synchronizing, but it is not as desirable as the first code. The reason is that, in this code, we 
have to wait to receive the first bit of the next code word to recognize that a new code word 
has started. However, in Code 1, we recognize the last bit without having to receive the first bit 
of the next code word. Both Codes 1 and 2 are therefore uniquely decodable. However, only 
Code 1 is instantaneous. Codes 1 and 3 have the convenient property that no code word is the 
prefix of another code word. They satisfy the prefix condition. We can prove that a necessary 
and sufficient condition for a code to be both uniquely decodable and instantaneous is that 
it satisfy the prefix condition. This means that both Codes 1 and 3 are uniquely decodable 
and instantaneous. However, Code 3 has the advantage of having a smaller average code word 
length. In fact, for Code 1, the average code word length is 


1 1 1 1 1 
E[L]=1x3+2x7+3x g t4x g t5 gg 73116 


and for Code 3, the average code word length is 





1 1 1 1 1- 
E = =: — = š 
[L] S PO TT 30/16 


Code 4 has a major disadvantage. This code is not uniquely decodable. For example, the 
sequence 110110 can be decoded in two ways, as asas or as a4a2a3. Codes that are not uniquely 
decodable are not desirable and should be avoided in practice. From this discussion, we can see 
that the most desirable of the four codes is Code 3, which is uniquely decodable, instantaneous, 
and has the least average code word length. This code is an example of a Huffman code, which 
we will discuss, shortly. E 


As already mentioned, the idea in Huffman coding is to choose code word lengths 
such that more probable sequences have shorter code words. If we can map each source 
output of probability p; to a code word of length approximately log ra and, at he same 
time, ensure unique decodability, then we can achieve an average code word length of 
approximately }°, p; log S which is equal to H(X). Huffman codes are uniquely decod- 
able instantaneous codes with a minimum average code word length. In this sense, they are 
optimal. By optimal, we mean that among all codes that satisfy the prefix condition (and 
hence are uniquely decodable and instantaneous), Huffman codes have the minimum aver- 
age code word length. Next, we present the algorithm for the design of Huffman codes. 
From the algorithm, it is obvious that the resulting code satisfies the prefix condition. 
The proof of the optimality is omitted; for more information, refer to the references in 
Section 12.7. 


. Huffman Coding Algorithm. 


1. Sort source outputs in decreasing order of their probabilities. 


2. Merge the two least probable outputs into a single output whose probability is the 
sum of the corresponding probabilities. 


3. If the number of remaining outputs is 2, then go to Step 4; otherwise go to Step 2. 


4. Arbitrarily assign 0 and 1 as code words for the two remaining outputs. 
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Sort in decreasing 
order of probability 


| 


Merge the two | ; 
- least probable | 















Number of elements = 2? 


Assign 0 and 1 to 
the two code words | . 

















Is any element the 
result of the merger 
of two elements? 


Append the code word 
with 0 and 1 









Figure 12.6 Huffman toding 
se i algorithm. x 


5. If an output is the result of the merger of two outputs in a preceding step, append the 
current code word with a 0 and a 1 to obtain the code word for the preceding outputs; 
then, repeat Step 5. If no output is preceded by the output in current step, then stop. 


Figure 12\6 shows a flow chart of this algorithm. 


Example 12.3.2 
Design a Huffman code for the source given in the preceding Example 12.3.1: 


Solution The following tree diagram summarizes the design steps for code construction and 
the resulting code words: 
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0 Ł 0 
1 4 
uo 4 
110 $ 
1111 7 


The average code word length of a Huffman code is defined by 
R=E[L]= > pœ), (12.3.1) 
xed 


where I(x) is the length of the code word corresponding to the source output x. It can be 
shown that the average code word length satisfies the following inequality: 


H(X) < R < H(X) +1. (12.3.2) 
From this relation, it is obvious that the efficiency of a Huffman code, defined as 
_ H(X) 
a 


is always less than or equal to 1. 
If the Huffman code is designed for sequences of source letters of length n (the nth 
extension of the sourée), we would have 


H(X") < Re < A(X") +1, 
where R, denotes the average code word length for the extended source sequence; there- 


fore, R = +R. If our source is memoryless, we also have H(X”) = nH (X). Substituting 
these into the preceding equation and dividing by n, we have 


H(X) < R < H(X) + 1 (12.3.3) 
z l 





Therefore, R canbe made as close to H(X) as desired by selecting n large enough. It is 
also obvious that, for discrete sources with memory, R approaches the entropy rate of the 
source as defined in Equation (12.1.14). 


Example 12.3.3 


A discrete memoryless source with equiprobable outputs and alphabet M = {a1, az, a3} has 
the following Huffman code: 
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TG eee 

o 4 0 
1 0 

10 4 2 
; 1 

u 3 i 


The entropy of the source is H(X) = log3 = 1.585 and R = 3 = 1.667. If we use 
sequences of two letters, we will have the source 


A = {(ay, a1), (a1, a2), (a1, 43), - (a3, m), (a3, a3)} 
lliaiiliid 


with the probability vector p? = {1, 4,4, 4,1,1, 1,4,4 |. A Huffman code for this source 
is designed in the following tree diagram: 





000 a 
o0 4 
0011 1 
D 
O11 5 
011 1 
100 3 
110 1 
111 3— s 


Here, the average code word length is R} = 3.222 bits per pair of source outputs or 
1.611 bits per each source output. Comparing this with the previous example, we see that the 
average length is closer to the entropy of the source. In other words, working with the second 
extension of the source (i.e., two letters at a time) has improved the efficiency of the coding. If 
we use the third extension of the source, the efficiency will improve further. It is also obvious 
that, although the efficiency of the Huffman code improves by increasing n, the complexity of 
the code design and the encoding/decoding process increases as well. m 


12.3.2 The Lempel-Ziv Source Coding Algorithm 


We have already seen that Huffman codes are optimal in the sense that, for a given source, 
they provide a prefix code with minimum average block length. Nevertheless, there are 
two major problems in implementing Huffman codes. One problem is that the design of a 
Huffman code depend strongly on the source probabilities (statistics). The source statistics 
have to be known in advance to design a Huffman code. If we can only observe the source 
outputs, then we have to do the coding in two passes. In the first pass, we estimate the 
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statistics of the source (which, in the case of sources with memory and in cases where we 
want to apply Huffman coding to the extension of the source, becomes quite time consum- 
ing); in the second pass, coding is done. The other problem with Huffman codes is that if 
the code is designed for source blocks of length 1, it only employs variations in the fre- 
quency of the source outputs and not the source memory. If we want to employ the source 
memory as well, we have to design the code for blocks of lengtli 2 or more; this exponen- 
tially increases the complexity of the algorithm. For instance, encoding ASCII characters 
with a block length of 1 requires a tree with 256 terminal nodes, but if a block length of 2 is 
desired, the size of the tree and the complexity of coding becomes much higher. In certain 
applications, such as storage in magnetic or optical media, where high transfer rates are 
. desirable, the complexity and speed of Huffman coding becomes a bottleneck. 

The Lempel—Ziv algorithm belongs to the class of universal source coding algo- 
rithms, i.e., algorithms that are independent of the source statistics. This algorithm is a 
variable-to-fixed length coding scheme. This means that any sequence of source outputs is 
uniquely parsed into phrases of varying length and these phrases are encoded using code 
words of equal length. Parsing is done by identifying phrases of the smallest length that 
have not appeared before. To this end, the parser observés the source output. As long as the 
new source output sequence after the last phrase coincides with one of the existing phrases, 
no new phrase is introduced and another letter from the source is considered. As soon as 
the new output sequence is different from the previous phrases, it is recognized as a new 
phrase and encoded. The encoding scheme is simple. The new phrase is the concatenation 
of a previous phrase and a new source output. To encode it, the binary expansion of the lex- 
icographic ordering of the previous phrase and the new bit are concatenated. For example, 
assume that we want to parse and encode the following sequence: 


0100001 1000010100000101000001 100000101000010. 
Parsing the sequence by the rules previously explained results in the following phrases: 
0, 1, 00, 001, 10, 000, 101, 0000, 01, 010, 00001, 100, 0001, 0100, 0010, 


Clearly, all the phrases are different and each phrase is one of the previous pkrases con- 
catenated with a new source output. The number of phrases is 15. This means that, for each 
phrase, we need 4 bits, plus an extra bit to represent the new source output. The preceding 
sequence is encoded by 


0000 0, 0000 1, 0001 0, 0011 1, 0010 0, 0011 0, 0101 1, 0110 0, 0001 
1, 1001 0, 1000 1, 0101 0, 0110 1, 1010 0, 0100 0 


Table 12.1 summarizes this procedure. 

This representation can hardly be called a data compression scheme because a 
sequence of length 44 has been mapped into a sequence of length 75. However, as the 
length of the original sequence is increased, the compression role of this algorithm becomes 
more apparent. We can prove that for a stationary and ergodic source, as the length of the 
sequence increases, the number of bits in the compressed sequence approaches nH (X). 
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TABLE 12.1 SUMMARY OF LEMPEL-ZIV EXAMPLE 





Dictionary Dictionary Code Word 











Location Contents 

1 0001 0 0000 0 
2 0010 1 0000 1 
3 0011 00 0001 0 
4 0100 001 0011 1 
: 5 0101 10 0010 0 
6 0110 000 0011 0 
F: 0111 101 0101 1 
8 1000 0000 0110 0 
9 1001 01 0001 1 
10 1010 010 1001 0 
11 1011 00001 1000 1 
12 1100 100 0101 0 
13 1101 0001 0110 1 
14 1110 0100 1010 0 
15 1111 0010 0100 0 


where H (X) is the entropy rate of the source. The decompression of the encoded sequence 
is straightforward and can be done very easily. 

One problem with the Lempel—Ziv algorithm is how the number of phrases should be 
chosen. Here we have chosen 15 phrases, which leaves us 4 bits to represent each phrase. 
In general, any fixed number of phrases will eventually become too small and overflow 
would occur. For example, if we were to continue coding this source for additional input 
letters, we could not add the new phrases to our dictionary because we have assigned four 
bits for representation of the elements of the dictionary and we already have 15 phrases in 
the dictionary. To solve this problem, the encoder and decoder must purge their dictionaries 
of elements that are not useful anymore and substitute new elements for them. The purging 
method should, of course, be a method on which the encoder and the decoder have agreed. 

The Lempel—Ziv algorithm is widely used in practice to compress computer files. 
The “compress” and “uncompress” utilities under the UNIX® operating system and other 
compression programs (zip, gzip, etc.) are implementations of various versions of this 
algorithm. 
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12.4 MODELING OF COMMUNICATION CHANNELS 


The mathematical model for an information source together with a quantitative measure 
of information, as well as bounds on compression of an information source that are given 
by the source entropy, were presented in the preceding sections of this chapter. In this 
section, we study the other important component of a communication system, i.e., the 
communication channel. We also introduce the concept of coding for the protection of 
; messages against channel errors. 











— 
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As defined in Chapter 1, a communication channel is any medium over which infor- 
mation can be transmitted or in which information can be stored. Coaxial cable, iono- 
spheric propagation, free space, fiber optic cables, and magnetic and optical disks are 
examples of communication channels. What is common among these is that they accept 
signals at their inputs and deliver signals at their-outputs at a later time (storage case) or at 
another location (transmission case). Therefore, each communication channel is character- 
ized by a relation between its input and output. In this sense, a communication channel is 
a system. 

There are many factors that cause the output of a communication channel to be differ- 
ent from its input. These factors include attenuation, nonlinearities, bandwidth limitations, 
multipath propagation, and noise. All these factors contribute to a complex input--output 
relation in a communication channel. Due to the presence of fading and noise, the input- 
output relation in a communication channel is generally a stochastic relation. 

Channels encountered in practice are generally waveform channels that accept 
continuous-time waveforms as their inputs and produce waveforms as their outputs. 
Because the bandwidth of any practical channel is limited, by using the sampling theo- 
rem a waveform channel becomes equivalent to a discrete-time channel. In a discrete-time 
channel, both the input and the output are discrete-time signals. 

In a discrete-time channel, if the values that the input and output variables can take 
are finite, or countably infinite, the channel is called a discrete channel. An example of a 
discrete channel is a binary-input binary-output channel. In general, a discrete channel is 
defined by Æ, the input alphabet, YY, the output alphabet, and p(y|x), the conditional PMF 
of the output sequence given the input sequence. A schematic representation of a discrete 
channel is given in Figure 12.7. In general, the output y; does not only depend on the input 
at the same time x;, but also on the previous inputs (channels with intersymbol interference, 
see Chapter 9) or even on previous and future inputs (in storage channels). Therefore, a 
channel can have memory. However, if a discrete channel does not have memory, it is 
called a discrete memoryless channel or DMC. For such a channel, for any y € Y” and 
x € Æ”, we have 


e 


P(ylx) =| | Poix. (12.4.1) 
i=l 


All channel models that we will discuss in this chapter are memoryless. 


# Y 
Input alphabet Output alphabet 


p(ylx) 


Figure 12.7 A discrete channel. 
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o PO|0)=1-e 0 






P(1|0) =e 


P(O|1) =e 


1 P(il1)=1-€e 1 Figure12.8 The binary symmetric channel. 


A special case of a discrete memoryless channel is the binary symmetric channel 
or BSC. Figure 12.8 shows a binary symmetric channel. In a binary symmetric channel, 
€ = P(Oj1) = P(1(0) is called the crossover probability. 

Example 12.4.1 

Assume that we are dealing with an additive white Gaussian noise channel with binary antipo- 

dal signaling (for instance, a binary PSK modulation system). We have already seen in 


Chapters 8 and 10 that, in such a channel, the error probability of a 1 being detected as 0 
or a 0 being detected as 1 is given by 


e€ = P(1|0) = PO|l)=Q ae ; (12.4.2) 


where No is the noise power spectral density and @, denotes the energy content of each of 

the antipodal signals representing 0 and 1. This discrete channel is an example of a binary 

symmetric channel. a 
Example 12.4.2 

In an AWGN channel with binary antipodal signaling, the input is either y, or ~ 8s. 

The output is the sum of the input and the Gaussian noise. Therefore, for this binary-input, 

continuous-output channel, we have ¥ = {+,/,}, Y = R, and 





1 -Q 
f(ylx) = e w, 
v 2mo? 
where øg? denotes the variance of the noise. This an example of a discrete input—continuous 
output channel. | 


The most important continuous alphabet channel is the discrete-time, additive white 
Gaussian noise channel with an input power constraint. In this channel, both & and Y are 
the set of real numbers, and the input—output relation is given by 


Y=X44Z, (12.4.3) 


where Z denotes the channel noise, which is assumed to be Gaussian with mean equal to 0 
and variance equal to Py. We further assume that inputs to this channel satisfy some power 
constraint. For example, for large n, input blocks of length n satisfy 


1 n 
-5x <P, (12.4.4) 
n i=] 


where P is some fixed power constraint. This channel model is shown in Figure 12.9. 
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n 
Input power constraint 7 Dd x7 =P Figure 12.9 Additive white Gaussian noise 
i=l channel with power constraint. 


12.5 CHANNEL CAPACITY 


We have already seen that H (X) defines afundamental limit on the rate at which adiscrete 
source can be encoded without errors in its reconstruction. A similar “fundamental limit” 
exists also for information transmission over communication channels. 

Of course, the main objective when transmitting information over any communica- 
tion channel is reliability, which is measured by the probability of correct reception at the 
receiver. Due to the presence of noise, at first glance it seems that the error probability is 
always bounded away from zero by some positive number. However, a fundamental result 
of information theory is that reliable transmission (that is, transmission with an arbitrarily 
small error probability) is possible even over noisy channels as long as the transmission rate 
is less than some number, called the channel capacity. This remarkable result, first shown 
by Shannon (1948b), is known as the noisy channel coding theorem. What the noisy chan- 
nel coding theorem says is that the basic limitation that noise causes in a communication 
channel is not on the reliability of communication, but on the speed of communication. 

Figure 12.10 shows a discrete memoryless channel with four inputs and outputs. If 
the receiver receives an a, it does not know whether an a or a d was transmitted; if it 
receives a b, it does not know whether an a or a b was transmitted, etc. Therefore, there 
always exists a possibility of error. But if the transmitter and the receiver agree that the 
transmitter only uses the letters a and c, then there exists no ambiguity. In this case, if 
the receiver receives an a or a b, it knows that an a was transmitted; if it receives ac or 
a d, it knows that a c was transmitted. This means that the two symbols a and c can be 





Figure 12.10 An example of a discrete channel. 
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æ” = {0, 1}" ` i gy” = {0, 1}" 


PO |x) = Tl Pola) 


x: A binary sequence of length n. Figure 12.11 Thenth extension of a 


y: A binary sequence of length n. binary symmetric channel. 


transmitted over this channel with no error, i.e., we are able to avoid errors by using only 
a subset of the possible inputs to the channel. Admittedly, using a smaller subset of the 
possible inputs reduces the number of possible inputs, but this is the price that must be 
paid for reliable communication. This is the essence of the noisy channel coding theorem, 
i.e., using only those inputs whose corresponding possible outputs are disjoint, and thus do 
not cause ambiguity as to what message was transmitted. The chosen inputs should be “far 
apart” such that their “images” under the channel operation are nonoverlapping (or have 
negligible overlap). 

Looking at the binary symmetric channel and trying to apply this approach, we 
observe that there is no way that we can have nonoverlapping outputs. In fact, this is 
the case with most channels. To use the results of this argument for the binary symmet- 
ric channel, we have to apply it not to the channel itself, but to the extension channel. 
The nth extension of a channel with input and output alphabets Æ% and Y and conditional 
probabilities P(y|x) is a channel with input and output alphabets 2” and Y” and con- 
ditional probability P(y|x) = []j_, P@;:|x:). The nth extension of a binary symmetric 
channel takes binary blocks of length n as its input and its output. This channel is shown 
in Figure 12.11. By the law of large numbers, discussed in Chapter 5, if n is large enough 
and a binary sequence of length n is transmitted over the channel, the output will disagree 
with the input with high probability at ne positions. The number of possible sequences that 
disagree with a sequence of length n at ne positions is given by 


n\ _ n! 
(a) — (n —ne)\(ne)! 


Using Stirling’s approximation n ! + n”e~"./ 210, we obtain 


(i) a DH) (12.5.1) 
n 


?Recall that, in Huffman coding, we also applied the coding algorithm to the nth extension of the source 
to achieve efficient compression. 
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æ" = {0, 1}" y” = {0, 1}" 
Number of elements ~ 2”76() 
(with high probability) 






Total number of highly 


bps ts ~ 20H) Figure 12.12 Schematic representation 
probable elements ~ : 


of a BSC. 
r 


where H, (€) = —e log, € — (1 — €) log,(1 — €) is the binary entropy function as 
defined in Equation (12.1.2). This means that, for any input block, there are roughly 2”%© 
highly probable corresponding outputs. On the other hand, the total number of highly 
probable output sequences (i.e., typical output sequences) is roughly 2””“, The maxi- 
mum number of input sequences that produce almost nonoverlapping output sequences, 
therefore, is at most equal to 


2nH(Y) 


= — 9n(H(¥)—H,(€)) 
= ae) = 2 ote (12.5.2) 


This means that in n uses of this channel, we can transmit at most log, M = n(H(Y) — 
Hp(e)) bits and the transmission rate per channel use is 


log, M 
R= 22 
n 


= H(Y) — Ape). (12.5.3) 
Figure 12.12 gives a schematic representation of this case. 

Intherelation R = H(Y) — Hp(e), € depends on the channel and we cannot control 
it. However, the probability distribution of the random variable Y depends both on the 
input distribution P(x) and the channel properties characterized by €. To maximize the 
transmission rate over the channel, we have to choose a P(x) that maximizes H(Y). If X 


is a uniformly distributed random variable, such that p(X = 0) = P(X = 1) = 0.5, then 
H (Y) will be maximized at one; therefore, we obtain 


R=1- 4H,(e). (12.5.4) 


We can also prove that this rate is the maximum rate at which reliable transmission? over 
the BSC is possible. Therefore, for a BSC the capacity is given by 


C =1-—- H,(e). (12.5.5) 


A plot of the channel capacity in this case is given in Figure 12.13. It is interesting 
to note that the cases € = 0 and € = 1 both result in C = 1. This means that a channel that 


3By reliable transmission, we mean that the error probability will tend to zero as the block length n tends 
to infinity. 
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1— H,(e) 


C= 


0 Figure 12.13 The capacity 
0 01 02 03 04 05 06 07 08 09 1 €  ofaBSC. 


always flips the input is as good as the channel that transmits the input with no errors. The 
worst case, of course, happens when the channel flips the input with probability 1/2. 

The maximum rate at which we can communicate over a discrete memoryless chan- 
nel and still make the error probability approach zero as the code block length increases is 
called the channel capacity and is denoted by C. The noisy channel coding theorem stated 
next gives the capacity of a general discrete memoryless channel. 


Noisy Channel Coding Theorem. The capacity of a discrete memoryless chan- 
nel is given by 


C = max I (X; Y), l (12.5.6) 


where I (X; Y) is the mutual information between X and Y, the channel input and output, as 
defined in Equations (12.1.15) and (12.1.16). If the transmission rate R is less than C, then 
reliable communication at rate R is possible, and if R > C, then reliable communication 
at rate R is impossible. 

In this theorem, R is BiM. where M is the number of messages transmitted over the 
nth extension of the channel, and “reliable transmission” refers to transmission wherein 
the error probability can be made arbitrarily close to zero by increasing n. Both rate R and 
capacity C are measured in bits per transmission or bits per channel use. 

This theorem is one of the most important results in information theory and gives 
a fundamental limit on the possibility of reliable communication over a noisy channel. 
According to this theorem, regardless of all other properties, any communication channel is 
characterized by a number called capacity, which determines how much information can be 
transmitted over it. Therefore, to compare two channels from an information transmission 
point of view, it is enough to compare their capacities. 
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Example 12.5.1 
Find the capacity of the channel shown in Figure 12.14. 
Solution We have to find the input distribution that maximizes J (X; Y). We have 


3 


I(X; Y) = H(Y) — H(Y|X). 


But 
H(Y|X) = P(X =a)H(Y|X =a) + P(X =b)H(Y|X =b) 
+P(X = c)H(Y|X =c). 
ai the channel input-output relation, we see that for all three cases (X = a, X = b, and 


= c), Y is a ternary random variable with probabilities 0.25, 0.25, and 0.5. Theréfore, 
H(Y|X =a) = H(Y|X =b) = H(Y|X =c) = 155. 
Then, because P(X = a) + P(X =b)+ P(X = c) = 1, it follows that 
H(Y|X) = 1.5 


and 
I(X; Y) = H(Y) - 1.5. 


To maximize /(X;Y), it remains to maximize H(Y), which is maximized when Y is an 
equiprobable random variable. But it is not clear if there exists an input distribution that results 
in a uniform distribution on the output. However, in this special case (due to the symmetry of 
the channel), a uniform input distribution results in a uniform output distribution, and for this 
distribution, 


H(Y) = log3 = 1.585. 
Therefore, the capacity of this channel is given by 
C = 1.585 — 1.5 = .085 bits/transmission. 


This means that in order to transmit one bit of information, at least 


EA 


transmissions over this channel are required. a 





Figure 12.14 The DMC of Example 12.5.1. 
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12.5.1 Gaussian Channel Capacity 


A discrete-time Gaussian channel with input power constraint is characterized by the 
input—output relation 


Y=X+Z, (12.5.7) 


where Z is a zero-mean Gaussian random variable with variance Py, and for n large 
enough, an input power constraint of the form 


1 n 
-S x <P (12.5.8) 
n 

i=] 


applies to any input sequence of length n. For blocks of length n at the input, the output, 
and the noise, we have 


y=x+z. (12.5.9) 


If n is large, by the law of large numbers, we have 


1 n 1 n 
= t= sD inal s Py (12.5.10) 
i=1 i=l 
or 
|y- x? <nPy. (12.5.11) 


This means that, with probability approaching 1 (as n increases), y will be located in an 
n-dimensional sphere (hypersphere) that has a radius ./n Py and is centered at x. On the 
other hand, due to the power constraint of P on the input and the independence of the input 
and noise, the output power is the sum of the input power and the noise power, i.e., 


io : 
-y < P+ Py (12.5.12) 
n i=] 
or 
ly|? <n(P + Py). (12.5.13) 


This implies that the output sequences (again, asymptotically and with high probability) 
will be inside an n-dimensional hypersphere of radius ./n(P + Py) and centered at the 
origin. Figure 12.15 shows the sequences in the output space. 

The question now is, How many x sequences can we transmit over this channel such 
that the hyperspheres corresponding to these sequences do not overlap in the output space? 
Obviously, if this condition is satisfied, then the input sequences can be decoded reliably. 
An equivalent question is as follows: How many hyperspheres of radius y/n Py can we pack 
in a hypersphere of radius ./n(Py + P)? The answer is roughly the ratio of the volumes 
of the two hyperspheres. Note that, for an ordinary three-dimensional sphere, the volume 
is $m R?; for a two-dimensional case (circle), the “volume” (i.e., the area) is mR?. We can 
show that, in general, the volume of an n-dimensional hypersphere is proportional to R”. 
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n-Dimensional 
hypersphere 


F n-Dimensional 
radius =VnPy 


hypersphere 
radius = va (P + Py) 






Figure 12.15 The output 
sequences of a Gaussian 
channel with power constraint. 


If we denote the volume of an n-dimensional hypersphere by 
Va = Kn R”, (12.5.14) 


where R denotes the radius and K,, is independent of R, we see that the number of mes- 
sages that can be reliably transmitted over this channel is equal to i 


m = Kna(Pn + P)? 
K, (nPy)? 





= ( + =) : (12.5.15) 


Therefore, the capacity of a discrete-time, additive white Gaussian noise channel with input 
power constraint P is given by 


1 
C =~ logM 
l n P 
= —--log(1+— 
n 5 Toe ( +z) 
1 P 
= -log({1+—}. 12.5.16 
5108 ( +5) ( ) 


Bandlimited Gaussian Waveform Channels. When dealing with a continuous- 
time, bandlimited, additive white Gaussian noise channel with noise power spectral density 
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No input power constraint P, and bandwidth W, we can sample at the Nyquist rate and 


2 > 
obtain a discrete time channel. The power per sample will be P and the noise power per 
sample will be 


+W 
N 
Py = 5 af = WN. 
-W 


Substituting these results in Equation (12.5.16), we obtain 


1 P 
c= 5 log (1 + xa) bits/transmission. (12.5.17) 


If we multiply this result by the number of transmissions per second, which is 2W by the 
Nyquist sampling theorem, we obtain the channel capacity in bits/sec: 


P 
C = Wlog (1 + xa) bits/sec. (12.5.18) 


This is the celebrated Shannon’s formula for the capacity of a bandlimited additive white 
Gaussian noise channel. 


Example 12.5.2 


Find the capacity of a telephone channel with bandwidth W = 3000 Hz, and signal-to-noise 
ratio of 39 dB. 


Solution The signal-to-noise ratio of 39 dB is equivalent to 7943. Using Shannon’s relation, 
we have 


C = 3000 log(1 + 7943) ~ 38,867 bits/sec. a 


12.6 BOUNDS ON COMMUNICATION 


From the previous section, the capacity of a bandlimited additive white Gaussian noise 
channel is given by 
P 
C = Wlog (1 + sar) i 

Fromthis result, the basic factors that determine the channel capacity are the channel band- 
width W, the noise power spectrum No, and the signal power P. There exists a trade-off 
between P and W in the sense that one can compensate for the other. Increasing the input 
signal power obviously increases the channel capacity, because when we have more power 
to spend, we can choose a larger number of input levels that are far apart. Therefore, we 
can send more information bits per transmission. However, the increase in capacity as a 
function of power is logarithmic and slow. This is because if we are transmitting with a 
certain number of input levels that are A apart to allow a certain level of immunity against 
noise, and we want to increase the number of input levels, we have to introduce new levels 
with amplitudes higher than the existing levels, and this requires a lot more power. This fact 
notwithstanding, the capacity of the channel can be increased to any value by increasing 
the input power. i 
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Figure 12.16 Plot of channel 
capacity versus bandwidth. 


The effect of the channel bandwidth, however, is quite different. Increasing W has 
two contrasting effects. On one hand, on a higher bandwidth channel, we can transmit 
more samples per second and, therefore, increase the transmission rate. On the other hand, 
a higher bandwidth means higher noise power at the receiver, and this deteriorates its per- 
formance. This is seen from the two W’s that appear in the relation that describes the 
channel capacity. To see the effect of increasing the bandwidth, we let the bandwidth W 
tend to infinity. 


P P P 
lim W log(1 + ——) = — loge © 1.44—. 12.6.1 
W-> 00 a MW? No Re No ( ) 
This means that, contrary to the power case, by increasing the bandwidth alone, we cannot 
increase the capacity to any desired value. Figure 12.16 shows C plotted versus W. 
In any practical communication system, we must have R < C. If an AWGN channel 
is employed, we have 
P 
R< WI 1+ —— }. 12.6.2 
T azsa 
By dividing both sides by W and defining r = z, the spectral bit rate (or bandwidth 
efficiency), we obtain 
P 
r < log ( + z) (12.6.3) 
If €, is the energy per bit,* then 6, = 2, By substituting in the previous relation, we 
obtain 


r < log (1 + re) (12.6.4) 
0 


“Note that P = energy/sec and R = bits/sec; hence, P/R = energy/bit = E. 
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Figure 12.17 Spectral bitrate versus SNR/bit in an optimal system. 
or, equivalently, 
Sp 2-1 
— 12.6.5 
N, 0 r ( ) 


This relation is plotted in Figure 12.17. This equation gives the relation between two impor- 
tant parameters in a communication system. These parameters are r, the spectral bit rate, 
which is a measure of the bandwidth efficiency of a communication system, and Sy the 
SNR/bit, which is a measure of the power efficiency of a system. With a higher value of 
r, the system is more bandwidth efficient. With a lower value of Ne , required to achieve a 


certain error probability, the system is more power efficient. 
The curve defined by 


8 $ 2-1 
r = log (1 + re) o = (12.6.6) 





divides the plane into two regions. In one region (below the curve), reliable communication 
is possible; in the other region (above the curve), reliable communication is not possible. 
The performance of any communication system can be denoted by a point in this plane; 
the closer the point is to this curve, the closer the performance of the system is to that of 
an optimal system. From this curve, we see that as r tends to 0, 

E 

— = n2 = 0.693 ~ —1.6 dB (12.6.7) 

No 
is an absolute minimum for reliable communication. In other words, for reliable commu- 
nication, we must have 


— > 0.693. (12.6.8) 
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In Figure 12.17, when r < 1, we are dealing with a case where bandwidth is large and the 
main concern is limitation on power. This case is usually referred to as the power-limited 
case. Signaling schemes, with high dimensionality, such as orthogonal, biorthogonal, and 
simplex schemes can be used in these cases. The case where r >> 1 happens when the 
bandwidth of the channel is small; therefore, it is referred to as the bandwidth-limited 
case. Low-dimensional signaling schemes with crowded constellations (PAM, QAM, and 
PSK) are implemented in these cases. 


The Information Transmission Theorem. In Section 12.2, we introduced the 
fundamental limit in coding of information sources. This fundamental limit is expressed in 
terms of the entropy of the source. Entropy gives a lower bound on the rate of the codes 
that are capable of reproducing the source with no error. If we want to transmit a source U 
reliably via a channel with capacity C, we require that 


H(U) <C. (12.6.9) 


This relation defines the fundamental limit on the transmission of information. Note 
that we have assumed that for each source output, one transmission over the channel is 
possible. 


Example 12.6.1 


A binary source with P(X = 0) = 5 is to be transmitted over a binary symmetric channel 
with a crossover probability of €. Determine the range of € for reliable communication of the 
source. Assume that the channel can be used once per source output. 


Solution We have 


1 1 3 3 
H(U) = -7 loge m 7 108 Te 0.8113. 


For reliable communication, we need H(U) < C. The channel capacity fora BSC is given by 
C = 1 — H,(e). Therefore, we must have 


0.8113 < 1— Hp (e) 


or 
Hi, (€) < 0.1887. 


This is a nonlinear equation and should be solved by numerical methods. This results in 
H,(0.0288) ~ 0.1887 and two acceptable regions for e, namely, O < € < 0.0288 and 
0.9712 <e <1. a 


12.7 SUMMARY AND FURTHER READING 


In this chapter we introduced mathematical models for information sources and introduced 
a measure of information for discrete random variables. This measure of information, 
expressed as entropy, or entropy rate for sources with memory, is based on the works of 
Nyquist, Hartly, and Shannon. We also introduced joint entropy, conditional entropy, and 
mutual information of two random variables. Shannon’s source coding theorem as well 
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pas Huffman coding and Lempel-Ziv algorithm for lossless data compression were subse- 
quently introduced and we saw that entropy (or entropy rate) is the fundamental limit on 
how much a source can be losslessly compressed. 

Then, we studied mathematical models for communication channels and introduced 
important channels models including the DMC, the BSC, the discrete-time AWGN channel 
with input power constraint, the binary-input AWGN channel, and the continuous-time 
bandlimited Gaussian waveform channel. We introduced the important notion of channel 
capacity and stated Shannon’s noisy channel coding theorem. The chapter concluded with 

„a discussion of the information transmission theorem. 

There exist many excellent books on information theory. The interested reader is 
referred to the books by Gallager (1968), Blahut (1987), and Cover and Thomas (2006), as 
well as Shannon’s original work Shannon (1948a, 1948b). 


PROBLEMS 


12.1 A discrete memoryless source has an alphabet {a), a2, a3, a4, as, ag} with corre- 
sponding probabilities {0.1, 0.2, 0.3, 0.05, 0.15, 0.2}. Find the entropy of this source. 
Compare this entropy with the entropy of a uniformly distributed source with the 
same alphabet. 


12.2 Let random variable X be the output of a discrete memoryless source that is uni- 
formly distributed with size N. Find the entropy of it. 


12.3 Show that H(X) > 0 with equality holding if and only if X is deterministic. _ 


12.4 Let X be a geometrically distributed random variable; i.e., 
P(X =k)=p(l -p k=1,2,3,.... 


1. Find the entropy of X. 
2. Knowing that X > K, where K is a positive integer, what is the entropy of X? 


12.5 Let Y = g(X), where g denotes a deterministic function. Show that, in general, 
H(Y) < H(X). When does equality hold? 


12.6 An information source can be modeled as a bandlimited process with a bandwidth 
of 6000 Hz. This process is sampled at a rate higher than the Nyquist rate to provide 
a guard band of 2000 Hz. We observe that the resulting samples take values in the 
set H = {—4, —3, —1, 2, 4, 7} with probabilities 0.2, 0.1, 0.15, 0.05, 0.3, 0.2. What 
is the entropy of the discrete time source in bits per output (sample)? What is the 
information generated by this source in bits per second? 


12.7 Let X denote a random variable distributed on the set 4 = {a1, a2, ..., an} with cor- 
responding probabilities {p1, po, ..., Pw}. Let Y be another random variable defined 
on the same set, but distributed uniformly. Show that 


H(X) < H(Y) 
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with equality if and only if X is also uniformly distributed. Hint: First prove the 

inequality Inx < x — 1 with equality only for x = 1, then apply this inequality to 
1 

pa Pn In (z). 

A random variable X is distributed on the set of all positive integers 1, 2, 3, ... with 


corresponding probabilities p1, p2, p3, .... We further know that the expected value 
of this random variable is given to be m, i.e., 


œœ 
X ipi =m. 
i=l 


Show that, among all random variables that satisfy the preceding condition, the geo- 
metric random variable that is defined by 


1 Py 
p= (1--) b= 13.23 334.5 
m m 


has the highest entropy. Hint: Define two distributions on the source, the first one 
being the foregoing geometric distribution and the second one an arbitrary distribu- 
tion denoted by q;. Then apply the approach of Problem 12.7. 


A memoryless source has the alphabet A = {—5, —3, —1, 0, 1, 3, 5} with corre- 
sponding probabilities {0.05, 0.1, 0.1, 0.15, 0.05, 0.25, 0.3}. 


1. Find the entropy of the source. 


2. Assume that the source is quantized according to the quantization rule 


q(—S5) = q(-3) = —4, 
q(—1) = 40) =4(1) = 0. 
q3) =46) =4 


Find the entropy of the quantized source. 


Using the two definitions of the entropy rate of a random process given in 
Equations (12.1.13) and (12.1.14), prove that for a DMS the entropy rate and the 
entropy are equal. 


A Markov process is a process with one step memory, i.e., a process such that 


D(Xnlxn-1, Xn—2,Xn—3,--.) = P(Xn|Xn—1) 


` for all n. Show that, for a stationary Markov process, the entropy rate is given by 


H (Xn|Xn-1). 


Using Equation 12.1.7, show that 


H(XIY) = È pPO)H(XIY = y). 
» 
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12.13 Let X and Y denote two jointly distributed discrete valued random variables. 


1. Show that 
H(X) = — È p(x, y) log px) 
x,y 


and 
HY) =-}_ p(x, y) log pO). 


x,y 
2. Use this result to show that 
H(X, Y) < H(X) + H(Y). 


When does the equality hold? Hint: Consider the two distributions p(x, y) and 
P(x) p(y) on the product set & x Y, and apply the inequality proved in Problem 
12.7 to > `x, y P(x, y) log rO), 


12.14 Use the result of Problem 12.13 to show that 
H(X|Y) < H(X) 
with equality if and only if X and Y are independent. 


12.15 Show that, in general, 


n 
H(X1,Xo,...,Xn) < X H(Xi). 


i=l 
When does the equality hold? 


12.16 Assume that a BSS generates a sequence of n outputs. 


1. What is the probability that this sequence consists of all zeros? 
2. What is the probability that this sequence consists of all ones? 


3. What is the probability that, in this sequence, the first k symbols are ones and 
the next n — k symbols are zeros? 

4. What is the probability that this sequence has k ones and n — k zeros? 

5. How would your answers change if, instead of a BSS, we were dealing with a 
general binary DMS with P(X; = 1) = p. 


12.17 Give an estimate of the number of binary sequences of length 10,000 with 3000 
zeros and 7000 ones. 


12.18 A memoryless ternary source with output alphabet a), a2, and a3 and corresponding 
probabilities 0.2, 0.3, and 0.5 produces sequences of length 1000. 


1. What is the approximate number of typical sequences in the source output? 


2. What is the ratio of typical sequences to nontypical sequences? 
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3. What is the probability of a typical sequence? 

4. What is the number of bits required to represent all output sequences? 

5. What is the number of bits required to represent only the typical output 
sequences? 

6. What is the most probable sequence and what is its probability? 

7. Is the most probable sequence a typical sequence? 


A source has an alphabet {a1, a2, a3, a4} with corresponding probabilities {0.1, 
0.2, 0.3, 0.4}. 


1. Find the entropy of the source. 


2. Whatis the minimum required average code word length to represent this source 
for error-free reconstruction? 


3. Design a Huffman code for the source and compare the average length of the 
Huffman code with the entropy of the source. 

4. Design a Huffman code for the second extension of the source (take two letters 
at a time). What is the average code word length? What is the average number 
of required binary letters per each source output letter? 

5. Which is a more efficient coding scheme: the Huffman coding of the original 
source or the Huffman coding of the second extension of the source? 


Design a Mutat code for a source with n output letters and corresponding proba- 
bilities {354 a i, ae ae sat}. Show that the average code word length for such a 
source is equal to the source entropy. 


Show that {01, 100, 101, 1110, 1111, 0011, 0001} cannot be a Huffman code for 
any source probability distribution. 


Design a ternary Huffman code, using 0, 1, and 2 as letters, for a source with output 
alphabet probabilities given by {0.05, 0.1, 0.15, 0.17, 0.18, 0.22, 0.13}. What is the 
resulting average code word length? Compare the average code word length with 
the entropy of the source. (In what base would you compute the logarithms in the 
expression for the entropy for a meaningful comparison?) 


Design a ternary Huffman code for a source with output alphabet probabilities given 
by {0.05, 0.1, 0.15, 0.17, 0.13, 0.4}. Hint: You can add a dummy source output with 
zero probability. 


A discrete memoryless source X has the alphabet {—5, —3, —1, 0, 1, 2, 3} with the 
corresponding probabilities {0.08, 0.2, 0.12, 0.15, 0.03, 0.02, 0.4}. 


1. Design a Huffman code for this source and find R, the average code word length 
of the Huffman code. 


2. Determine the entropy of the source and, using the result of Part 1, determine 
the efficiency of the Huffman code you designed. The efficiency is defined as 


T= 
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Now quantize the source using the following quantization rule: 


2, X=-5,-3 
X¥={0, xX=-1,0,1 
2 X=2,3 


What is the absolute minimum required rate (bits per symbol) for perfect recon- 
struction of X? 


. Looking at sequences of length 10,000 at the output of the quantized source 


(i.e., X), what is the possible number of sequences? What is the number of typ- 
ical sequences? 


. If a Huffman code is designed for the second extension of the original source 


before quantization (i.e., taking two letters at a time), what can you say about 
the average code word length per individual source symbol? Find bounds on the 
average code word length. 


A discrete memoryless information source is described by the alphabet # = {x1, x2, 
x3, X4, X5, X6} with probabilities {1/32, 1/8, 1/2, 1/16, 1/32, 1/4}, respectively. 


1. 


Design a Huffman code for this source and determine the average code word 
length of the Huffman code. 


. Can you improve the Huffman code by encoding the second extension of this 


source (in other words, using two letters at a time and designing the Huffman 
code for that source)? Why? 


. Is there any way to improve the performance of the Huffman code designed in - 


Part 1? (By improving the performance, we mean designing a code with a lower 
average code word length.) 


A discrete memoryless information source has the alphabet {a1 , a2, a3, a4, as} with 
the corresponding probabilities {0.1, 0.2, 0.05, 0.3, 0.35}. 


1. 


Can this source be compressed at a rate of 2 bits per source symbol such that 
lossless reconstruction of it is possible? 


. Now consider all sequences of length 1000 that this source can generate. How 


many of these sequences are possible? Write your answer in exponential form. 


. Approximately how many of the sequences in Part 2 are typical sequences? 


Write your answer in exponential form. 


. Now assume that you want to ‘merge two letters of the source into one new letter 


b, say, b = {a;, aj}, such that the resulting source (which now has four outputs 
instead of five) can be compressed at a rate of 1.5 bits per symbol and can be 
recovered with no loss. Which two letters would you merge into the new letter b? 


A discrete memoryless source with output alphabet {a Dl and corresponding prob- 
abilities {0.11, 0.18, 0.1, 0.2, 0.25, 0.05, 0.11} is to be transmitted with no errors to 
a destination. 
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1. What is the minimum rate required for transmission of this source? 


2. Design a Huffman code for this source. What is the average code word length of 
the Huffman code? How is this average code word length related to A answer 
in Part 1? 


3. A new source is obtained by grouping the outputs of this source as follows: 
bı = {a1, a2}, b2 = {a3, a4}, b3 = {a5, a6}, b4 = {a7}. Answer the question in 
Part 1 for this new source. 


Find the Lempel—Ziv source code for the binary source sequence 


00010010000001 10000100000001000000101000010000001 10100000001 100. 


Recover the original sequence from the Lempel—Ziv source code. Hint: You require 
two passes of the binary sequence to decide on the size of the dictionary. 


Using the definition of H (X) and H(X|Y), show that 


p(x, y) 


X; 
I(X; Y) = Le DDE pO) 


Now, by using the approach of Problem 12.7, show that 7 (X; Y) > 0 with equality 
if and only if X and Y are independent. 
Show that 


1. 1(X; Y) < min{H(X), H(Y)}. 


2. If |#| and |Y| represent the size of sets # and Y, respectively, then 7 (X; Y) < 
min{log |Æ], log |X|}. 


Show that /(X; Y) = H(X) + H(Y) — H(X, Y) = A(Y) — H(Y|X) = I (Y; X). 


Let X denote a binary random variable with P(X = 0) = 1 — P(X = 1) = p, and 
let Y be a binary random variable that depends on X through P(Y = 1|X = 0) = 
P(Y =0|X = 1) =e. 


1. Find H(X), H(Y), H(Y|X), H(X, Y), H(X|Y), and I (X; Y). 
2. For a fixed €, which p maximizes I (X; Y)? 
3. For a fixed p, which € minimizes I (X; Y)? 


Show that 
1(X; YZW) = I(X; Y) + I(X: ZY) + 1(X; WIZY). 
Can you interpret this relation? 


Let X, Y, and Z be three discrete random variables. 
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1. Show that if p(x, y, z) = p(z)p(xlz) p(yI|x), we have 
I(X; Y|Z) < I(X;Y). 

2. Show that if p(x, y, z) = p(x)p(y)p(z|x, y), then 
I(X; Y) < I (X; Y|Z). 

3. In each case, give an example where strict inequality holds. 


12.35 Find the capacity of the channel shown in Figure P-12.35. 





Figure P-12.35 


12.36 The channel shown in Figure P-12.36 is known as the binary erasure channel. Find 
the capacity of this channel and plot it as a function of e€. 


0 ene 0 
€ 
E 
€ 
1 1 
l-e Figure P-12.36 


12.37 Find the capacity of the cascade connection of n binary symmetric channels with 
the same crossover probability e. What is the capacity when the number of channels 
goes to infinity? 


12.38 Using Stirling’s approximation n! ~ n"e~"./21n, show that 


aan eee 2n Hele). 
nE 
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12.39 Show that the capacity of a binary-input, continuous-output AWGN channel with 
inputs +A and noise variance o? (see Example 12.4.2) is given by 


e= (8) 


where 
cam | -(u-x)?/2 2 d 
f(x) = iP ana logy Tew u. 


12.40 The matrix whose elements are the transition probabilities of a channel, i.e. 
p(vilx;)’s, is called the channel probability transition matrix. A channel is called 
symmetric if all rows of the channel probability transition matrix are permutations 
of each other, and all its columns are also permutations of each other. Show that in 
a symmetric channel, the input probability distribution that achieves capacity is a 
uniform distribution. What is the capacity of this channel? 


12.41 Channels 1, 2, and 3 are shown in Figure P-12.41. 


OQ a 4 0 ———_—————qcccmoec 
0.5 
1 
._—————c—“—“> b 1 oe d 
Channel 1 Channel 2 
a 
0.5 





Channel 3 


Figure P-12.41 


1. Find the capacity of channel 1. What input distribution achieves capacity? 


2. Find the capacity of channel 2. What input distribution achieves capacity? 
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3. Let C denote the capacity of the third channel, and let Cı and C, represent the 
capacities of the first and second channel. Which of the following relations holds 
true and why? 

(a) C < 3(C, + C2). 
(b) C = ¿(C1 + C). 
(c) C> 4(C1 +02). 
12.42 Let C denote the capacity of a discrete memoryless channel with input alphabet 


X = {x1,x2,...,xy} and output alphabet Y = {y1, y2, ..., Ym}. Show that C < 
min{log M, log N}. 


12.43 The channel C is (known as the Z channel) shown in Figure P-12.43. 


1. Find the input probability distribution that achieves capacity. 


2. What is the input distribution and capacity for the special cases € = 0, € = 1, 
and € = 0.5? 


3. Show that ifm such channels are cascaded, the resulting channel will be equiva- 
lent to a Z channel with e; = e”. 


4. What is the capacity of the equivalent Z channel when n —> oo? 


0 > 0 


1 1 
€ Figure P-12.43 


12.44 Find the capacity of the Channels A and B, as shown in Figure P-12.44. What is the 
capacity of the cascade channel AB? 


woe + a ean a nn eee 


b! Ib 
0.5 ! i 
05 2a 
C ! i c 
a eines E n Ra hae eS NA ae oe AA 1 Wo Se a ae ee Be 
Channel A Channel B 


Figure P-12.44 
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12.45 Find the capacity of an additive white Gaussian noise channel with a bandwidth of 
1 MHz, power of 10 Watts, and noise power spectral density of Mo = 10°? Wz, 


12.46 Channel C; is an additive white Gaussian noise channel with a bandwidth of W, 
transmitter power of P, and noise power spectral density of ™, Channel C3 is an 
additive Gaussian noise channel with the same bandwidth and power as channel C, 
but with noise power spectral density S„( f). We assume that the total noise power 


for both channels is the same; that is, 


WwW WwW No 
f sunvar = [par = Now. 


Which channel has a larger capacity? Give an intuitive reasoning. 


12.47 For the channel shown in Figure P-12.47, find the channel capacity and the input 
distribution that achieves capacity. 





2 Figure P-12.47 


12.48 Consider two discrete memoryless channels represented by (2 , p(yi|x1), Yı) and 
(42, p(y2|x2), Y2) with corresponding capacities Cı and C2. A new channel is 
defined by (®, x 22, p(yilx1) p(v2lx2), Yı x Y2). This channel is the “product 
channel” and models the case where xı € 2 and x2 € 22 are simultaneously trans- 
mitted over the two channels with no interference. Prove that the capacity of this 
channel is the sum of Cı and C3. 


12.49 Let (Xi, p(y; |x1), Yı) and (42, p(y2lx2), Y2) represent two discrete memoryless 
communication channels with inputs #;, outputs Y; and conditional probabilities 
p(y;\x;). Further assume that 2% 12%. = Ø and Yı MU. = Ø. We define the 
sum of these channels as a new channel with the input alphabet 2 U 22, output 
alphabet Yı U Y2, and conditional probability p(y;|x;) where i denotes the index 
of Æ to which the input to the channel belongs. This models a communication situ- 
ation where we have two channels in parallel, and at each transmission interval, we 
can use only one of the channels, the input and output alphabets are disjoint, and 
therefore at the receiver there is no ambiguity regarding which channel was being 
used. 


1. Show that the capacity of this channel satisfies 2° = 2“ + 22, where C, and 
Cı are the capacities of each channel. 
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2. Using the result of Part 1, show that if Ci = C2 = 0, we still have C = 1. In 
other words, show that we are able to transmit one bit per transmission using 
two channels with zero capacity. How do you interpret this result? 


3. Find the capacity of the channel shown in Figure P-12.49. 


2 2 Figure P-12.49 


12.50 X is a binary memoryless source with P(X = 0) = 0.3. This source output is 
transmitted over a binary symmetric channel with crossover probability € = 0.1. 


1. Assume that the source is directly connected to the channel, i.e., no coding is 
employed. What is the error probability at the destination? 
2. For what values of € is reliable transmission possible (with coding, of course)? 


COMPUTER PROBLEMS 


12.1 Huffman Coding 


The objective of this problem is to design a Huffman code using MATLAB. The dis- 
crete memoryless source output is generated from an alphabet Æ = {x1, x2,..., x9} 
with corresponding probabilities 


p = {0.2, 0.15, 0.13, 0.12, 0.1, 0.09, 0.08, 0.07, 0.06}. 
1. Design a Huffman code and sketch the corresponding code tree. Specify the code 
words for the nine symbols in the alphabet. 
2. Determine the average code word length of the Huffman code. 
3. Determine the entropy of the source and compare it with the average code word 
length of the Huffman code. 
12.2 Huffman Coding 


The objective of this problem is to design Huffman codes using MATLAB. The dis- 
crete memoryless source output is generated from the alphabet @ = {x1, x2,..., x6} 
with the corresponding probabilities 


p = {0.1, 0.3, 0.05, 0.09, 0.21, 0.25}. 
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1. Determine the entropy of the source. 


2. Design a Huffman code and sketch the corresponding code tree. Specify the code 
words for the six symbols in the alphabet. 


3. Determine the efficiency of the Huffman code designed in Part 2. 


4. Design a Huffman code for the source sequences of length 2 and sketch the code 
word tree. Specify the code words for the symbols of length 2 and determine the 
efficiency of the code. Compare the efficiency for length 2 sequences with that 
for length 1. 


12.3 Huffman Coding 


A discrete memoryless source is generated from the alphabet Æ = {x1, x2,..., Xo} 
with corresponding probabilities 


11ll 1 1 1 1 1 
P= 5555995 a o Gee 


1. Design a Huffman code and sketch the corresponding code tree. Specify the code 
words for the nine symbols in the alphabet. 


2. Determine the average code word length of the Huffman code, the entropy of 
the source, and the efficiency of the Huffman code. 


3. Under what conditions is the efficiency of the Huffman code equal to 1? 


12.4 Huffman Code for Printed English 


The probabilities of the letters of the alphabet occurring in printed English are given 
in the following table: 


Letter Probability Letter Probability Letter Probability 


A 0.0642 B 0.0127 C 0.0218 
D 0.0317 E 0.1031 F 0.0208 
G 0.0152 H 0.0467 I 0.0575 
J 0.0008 K 0.0049 L 0.0321 
M 0.0198 N 0.0574 O 0.0632 
P 0.0152 Q 0.0008 R 0.0484 
S 0.0514 T 0.0796 U 0.0228 
Vv 0.0083 W 0.0175 X 0.0013 
Y 0.0164 Z 0.0005 Space 0.1859 


1. Determine the entropy of printed English. 
2. Design a Huffman code for printed English. 


3. Determine the average code word length and the efficiency of the Huffman code. 
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12.5 Capacity of Binary Symmetric Channel 


Binary data are transmitted over an additive white Gaussian noise channel using 
BPSK signaling and hard-decision decoding at the output using optimal matched 
filter detection. 


1. Plot the error probability of the channel as a function of 
—, (12.7.1) 


where © is the energy in each BPSK signal and No/2 is the noise power spectral 
density. Assume that y changes from —20 to 20 dB. 


2. Plot the capacity of the resulting channel as a function fo y. 


12.6 Capacity of Binary Input AWGN Channel 


A binary input additive white Gaussian noise channel is modeled by the two binary 
input levels A and —A and additive zero-mean Gaussian noise with variance o°. In 
this case & = {—A, A}, VY = R, p(y|X = A) ~ N(A, 0%), and p(y|X = —A) ~ 
N(—A, o°). Plot the capacity of this channel as a function of A/o. Hint: Use the 
result of Problem 12.39. 


12.7 Capacity of a Bandlimited Additive White Gaussian Noise Channel 


The objective of this problem is to compute the capacity of a bandlimited additive 
white Gaussian noise channel using MATLAB. The capacity of this channel is given 


by the formula 
P 
C= W log, (1+ zr) , 


where W is the bandwidth of the channel, P is the average transmitted power, and 
No/2 is the power spectral density of the additive white Gaussian noise. 


1. Plot the capacity of the channel whose bandwidth W = 3000 Hz as a function 
of P/No, for values of P /No between —20 and 30 dB. 
2. Plot the capacity of the channel as a function of W when P/No = 25 dB. What 
is the limiting value of the channel capacity as W —> oo? 
12.8 Capacity of a Bandlimited Additive White Gaussian Noise Channel 
The capacity of an AWGN channel, given as 
C=WI1 1+ £ 
= o — 
82 MW 


can also be expressed in terms of ,/No, which is the SNR/bit that determines the 
probability of error. Since the average power P = €,/T, = C%,, where 1/7, = C 
is the rate in bits per second, this channel capacity formula may be expressed as 
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If we solve for ,/No as a function of C/W, we obtain 


€,  Ww-—1 





1. Plot the normalized capacity C/ W as a function of 6,/ No for the range of values 


0.1 < C/W < 10. It may be convenient to plot the graph as a function of the 
SNR/bit in dB (10 log;g €,/No). 


2. Determine the SNR/bit in the limitas C/W — 0. 
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In Chapter 12, we introduced fundamental bounds on source coding (data compression) 
and data transmission through a noisy channel. We saw that the fundamental bound for 
error-free source compression is given by the entropy (or entropy rate, for a source with 
memory) and the fundamental limit for reliable communication is given by the channel 
capacity. We also introduced some algorithms for source coding that achieved the bound 
predicted by theory. In this chapter, we present methods for channel coding in pursuit 
of achieving the bounds set by theory in Shannon’s noisy channel coding theorem, i.e., 
the channel capacity. It turns out that achieving channel capacity is more difficult than 
designing good source codes. 


13.1 THE PROMISE OF CODING 


We begin this section with an example that shows how coding can help us achieve lower 
error probabilities in digital communications. 


Example 13.1.1 


In a digital communication system, the transmitter power is P and the rate of the source is R. 
The system employs an M = 4 PSK signaling (QPSK), where a pair of information bits are 
mapped into any of the four signals shown in the constellation depicted in Figure 13.1. We 
can readily see that 6, = z, and the minimum Euclidean distance between any two signals in 
this constellation is given by 


P 
dan = 46e = 4. (13.1.1) 


Now assume that instead of transmitting a QPSK signal (which is two dimensional), three 
orthonormal signals are employed to transmit the same two bits. For example, we can assume 
that the orthonormal signals are given by Y (t), Y(t — T), and w(t — 2T), where ẹ (t) is equal 
to zero outside the interval [0, T] and 


T 
f w(t)dt =1. (13.1.2) 
0 
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Figure 13.1 Signal constellation for a 4PSK scheme. 





Figure 13.2 Code words on the vertices of a cube. 


This is obviously a set of orthonormal signals with a dimensionality of 3. Using these orthonor- 
mal basis signals, we construct the following four signals: 


s(t) = VE (yA + Wt -T) 4+ -27)); (13.1.3) 
x(t) = VE (+t) — W(t T) — W(t — 2T)); (13.1.4) 
st) = VE (-¥@) — VE- T) + Wt —27)); (13.1.5) 
sat) = VE (vA) + yt -— T) — W(t -27)) (13.1.6) 
or, equivalently, in vector notation, 
sı = V6 (41, 41, +1); (13.1.7) 
s2 = VE (41, -1, —1); (13.1.8) 
s3 = v6 (—1, —1, +1); (13.1.9) 
s4 = VE (-1, +1, —1). (13.1.10) 


The corresponding constellation is shown in the three-dimensional space in Figure 13.2. Obvi- 
ously, with this choice of code words, each code word differs from any other code word at two 
components. Therefore, the Euclidean distance between any two signals is given by 


d, = |si- sj? =88 fori  j. (13.1.11) 
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The energy © is easily related to the energy per bit ,. Since two bits are transmitted per 


waveform, 
26, = 38; (13.1.12) 
hence, 
2 2P 
$= -G = =-. A. 
RH ae (13.1.13) 
Therefore, by substituting the result in Equation (13.1.13) into Equation (13.1.11), we obtain 
16 P 
2 os ES . . 
d; = 3 R fori Æ j. (13.1.14) 


Comparing this with the minimum distance in the four-PSK signal, we observe that the 
minimum-distance squared has increased by a factor of 





a we 4 
= = a4 = —. (13.1.15) 
dss 4R 3 


Because the error probability is a decreasing function of the minimum Euclidean distance, we 
have reduced the error probability by employing this new scheme. In fact, we can say that 
the resulting reduction in error probability is equivalent to the reduction in error probability 
achieved by an increase in power by a factor of 4. This, in turn, is equivalent to 1.25 dB 
power gain. This power gain, of course, has not been obtained for free. We see that with this 
signaling scheme in a time duration of Z, which is the time duration to transmit two bits, we 
have to transmit three signals. Therefore, the width of these signals is reduced by a factor of 2, 
and the bandwidth required to transmit them is increased by a factor of 2. A second problem 
with this scheme is that obviously it is more elaborate and requires a more complex decoding 
scheme. E 


The foregoing example basically describes what a coding scheme does. Coding 
results in a lower error probability [which is equivalent to a higher effective signal-to-noise 
ratio (SNR)] at the price of increasing the bandwidth! and the complexity of the system. 
In the preceding exercise, we have increased the number of dimensions from 2 to 3 (from 
QPSK to a three-dimensional signaling). This is equivalent to the following mapping: 


(+1, +1) > (41, +1, +1), 
(+1, -1) > (+1, —1, -1), 
(-1,-1) => (—1, —1, +1), 
(—1, +1) -> (-1, +1, —1). 


As seen from this mapping, the role of coding has been to add a parity-check bit to the 
two information bits. The parity is added in such a way that the number of +1’s in the 
resulting code word is always an odd number (or, equivalently, the number of —1’s is an 
even number). 


'There exist coding modulation schemes that increase the Euclidean distance between code words, but do 
not increase the bandwidth. 
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In a general signaling scheme with coded waveforms, sequences of length k = RT 
of the source outputs are mapped into sequences of length n of the form 


si =v (£1, +1,---,+1). (13.1.16) 
— mamam 


n 


These points are located on the vertices of a hypercube of edge length 24/8. The ratio 


Ro = ua (13.1.17) 
n 

is defined to be the code rate. There exist a total of 2” vertices of an n-dimensional hyper- 
cube, of which we have to choose M = 2* as code words. Obviously, we have to select 
these 2* vertices so that they are as far apart from each other as possible. This makes 
the Euclidean distance between them large and, thus, reduces the error probability. In 
Example 13.1.1, we have k = 2 and n = 3. We chose 2% = 4 points from the possible 
23 = 8 vertices of the three-dimensional cube so that they were maximally apart. The rate 
of the resulting code is Rp = 2/3. 

Assume that we have chosen 2* vertices of the hypercube as the code words and each 
code word differs from another code word in at least a. components. This parameter is 
called the minimum Hamming distance of the code and will be defined more precisely 
in Section 13.2. The relation between Euclidean distance and Hamming distance is very 
simple. If the sequences s; and s; differ in dj} locations, then their Euclidean distance dÈ 
is related to d} by 


(E= > (=2v8) = 4d. (13.1.18) 
l<l<n 


«st gl 
is; #8; 


This means that the minimum Euclidean distance can be expressed in terms of the mini- 
mum Hamming distance as 


(dE) = 4dh p8. (13.1.19) 


Now if we assume that s; is transmitted and use the union bound (see Section 8.4.2), the 
probability of a code word error is upperbounded as 


4d € 
Py, <MQ Ne 
0 


z Sb Mo (13.1.20) 


where in the last step we have used the bound on the Q function introduced in Chapter 5. 
Noting that the energy content of each code word is nê and hasto be equalto PT, we have 


PEELA ana E A (13.1.21) 
n n n 
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where we have used the relation 6, = 2, Hence, 


M 
Py < 5 ente6, (13.1.22) 
(The index i has been deleted because the bound is independent of i.) If no coding were 
employed—that is, if we used all the vertices of a k-dimensional hypercube rather than 2* 
vertices of an n-dimensional hypercube—we would have the following union bound on the 
code word error probability: 


p, <mol [2 
M No 
M 
< oe (13.1.23) 


Comparing the two bounds, we conclude that coding has resulted in a power gain equiva- 
lent to 


Gooding 0, hes (13.1.24) 


which is called the asymptotic coding gain, or simply, the coding gain. As seen here, the 
coding gain is a function of two main code parameters, the minimum Hamming distance 
and the code rate. Note that, in general, Re < 1 and au. > 1; therefore, the coding gain 
can be greater or less than 1. It turns out that there exist many codes that can provide good 
coding gains. The relation defining the coding gain once again emphasizes that, for a given 
n and k, the best code is the code that can provide the highest minimum Hamming distance. 

To study the bandwidth requirements of coding, we observe that when no coding is 
used, the width of the pulses employed to transmit one bit is given by 


T, = —. 13.1.25 
b= k ( ) 


After using coding, in the same time duration that k pulses were to be transmitted, we must 


now transmit n pulses. This means that the duration of each pulse is reduced by a factor of 
k 


n = Re. Therefore, the bandwidth expansion ratio is given by 


Weed 1 
pao ea 2 et (13.1.26) 
Who coding Re k 


We can prove that in an additive white Gaussian noise (AWGN) channel, there exists a 


sequence of codes with parameters (n;, k;) with fixed rate e = R, independent of i) 
satisfying 
R< li 1+ £ (13.1.27) 
ES OBAS E MWA a 


where ; log (1 + wir) is the capacity of the channel in bits per transmission, for which 
the error probability goes to zero as n; becomes larger and larger. 

In this chapter, we study two major types of codes, block codes and convolutional 
codes. Block codes are the codes that we have already described. In a block code, the 
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kL 














Figure 13.3 A convolutional encoder. 


information sequence is broken into blocks of length k and each block is mapped into 
channel input blocks of length n. This mapping is independent from the previous blocks, 
i.e., there is no memory from one block to another block. In convolutional codes, we use 
a shift register of length kgL, as shown in Figure 13.3. The information bits enter the shift 
register ko bits at a time; then nọ bits that are linear combinations of various shift register 
bits, are transmitted over the channel. These no bits depend not only on the recent ko bits 
that just entered the shift register, but also on the (L — 1)ko previous contents of the shift 
register that constitute its state. The quantity 


m=L . (13.1.28) 


is defined as the constraint length of the convolutional code, and the number of states of 
the convolutional code is equal to 2‘4~", The rate of a convolutional code is defined as 


Ras (13.1.29) 


no 


The main difference between block codes and convolutional codes is the existence of mem- 
ory in convolutional codes. 


13.2 LINEAR BLOCK CODES 


An (n, k) block code is a collection of M = 2k binary sequences, each of length n, called 
code words. A code & consists of M code words c; for 1 <i < 2%; i.e., 


€ = {ce}, C2,°°° CM}, 


where each c; is a sequence of length n with components equal to 0 or 1. The collection of 
the code words is called the codebook or, simply, the code. 
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Definition 13.2.1. A block code is linear if the modulo-2 sum of any two code 
words is also a code word. This requires that if c; and cj are code words, then c; ® cj must 
also be a code word, where © denotes componentwise modulo-2 addition. a 


With this definition, we can readily see that a linear block code is a k-dimensional 
subspace of an n-dimensional space. It is also obvious that the all-zero sequence 0 is a code 
word of any linear block code, since it can be written as c; ® c; for any code word c;. We 
further assume that if the information sequence x, (of length k) is mapped into the code 
word c, (of length n) and the information sequence x2 is mapped into ¢2, then x; © x2 is 
mapped into cı @ c2. 


Example 13.2.1 
A (5, 2) code is defined by the code words 


@, = {00000, 10100, 01111, 11011}, 
where the mapping of information bits to code words is as follows: 
00 — 00000 
01 > 01111 
10 > 10100 
11 > 11011. 
It is easy to verify that this code is linear. However, the code defined by 
€2 = {00000, 11100, 01111, 11011} 
is not linear because the sum of the second and the third code words is nota code word. m 


Now we will define some of the basic parameters that characterize a code. 


Definition 13.2.2. The Hamming distance between two code words c; and cj is the 
number of components at which the two code words differ, i.e., the number of components 
where one code word is 1 and the other one is 0. The Hamming distance between two code 
words is denoted by d(c;, ¢;).” | 


Definition 13.2.3. The Hamming weight, or simply the weight of a code word c;, 
is the number of 1’s in the code word and is denoted by w(c;). | 


Definition 13.2.4. The minimum distance of a code is the minimum Hamming 
distance between any two different code words, i.e., 


dmin = min d (c;, Cj). (13.2.1) 
Che; 
ifj 


?Hamming distance is denoted by d and Euclidean distance is denoted by d®. 
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Definition 13.2.5. The minimum weight of a code is the minimum of the weights 
of the code words except the all-zero code word: 


Wmin = min w(c;). (13.2.2) 
cj;#0 


Theorem 13.2.1. In any linear code, dmin = Wmin- 

Proof. If c is a code word, then w(c) = d(c, 0). Also, if c; and cj are code words, 
so is c = c; ® cj and, moreover, d(c;,c;) = w(c). This implies that in a linear code 
corresponding to any weight of a code word, there exists a Hamming distance between two 
code words, and corresponding to any Hamming distance, there exists a weight of a code 
word. In particular, it shows that dmin = Wmin- 


Generator and Parity-Check Matrices. In an (n, k) linear block code, let the 
code word corresponding to the information sequences? e; = (1000... 0), e2 = (0100. . . 0), 
e3 = (0010...0),..., ex = (0000... 1) be denoted by g), 82, 23,..., Zk, respectively, 
where each of the g; sequences is a binary sequence of length n. Now, any information 
sequence x = (x1, X2, %3,..., Xk) can be written as 


n 
x= >> xe; (13.2.3) 


therefore, the corresponding code word will be 


n 
c=} x:g;. (13.2.4) 


If we define the generator matrix for this code as 


§1 811 8i2 --- 8in 
£ §21 822 +--+ 82 

Ga ee ee es (13.2.5) 
8k Ekl 8k2 +--+ Bkn 


then, from Equation (13.2.3), we can write 
c=xG, (13.2.6) 


where x is a 1 x k row vector and G is the k xn generator matrix. This shows that any linear 
combination of the rows of the generator matrix is a code word. The generator matrix for 
any linear block code is a k x n matrix of rank k (because the dimension of the subspace is 
k by definition). The generator matrix of a code completely describes the code. When the 
generator matrix is given, the structure of an encoder is quite simple. 


3In this chapter, all vectors are represented as row vectors. 
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Example 13.2.2 
Determine the generator matrix for the first code given in Example 13.2.1. 


Solution We have to find the code words corresponding to the information sequences (10) 
and (01). These are (10100) and (01111), respectively. Therefore, 


10100 
G= Remar (13.2.7) 


For the information sequence (x1, x2), the code word is given by 


(€1, C2, €3, Ca, C5) = (X1, X2)G (13.2.8) 
or 
cy = x; 
C2 = X2 


c3 = x BX2 
C4 = X2 


C5 = X2. a 


The preceding code has the property that the code word corresponding to each information 
sequence starts with a replica of the information sequence itself followed by some extra 
bits. Such a code is called a systematic code and the extra bits following the information 
sequence in the code word are called the parity-check bits. A necessary and sufficient 
condition for a code to be systematic is that the generator matrix be in the form 


G=|k | P|. (13.2.9) 


where I, denotes a k xk identity matrix and P is ak x (n—k) binary matrix. In a systematic 
code, we have 
; o 1<i<k 
g z= 


13.2.10 
Die Paix; k+l s<i<n ( ) 


where all summations are modulo 2. 

By definition, a linear block code © is a k-dimensional linear subspace of the 
n-dimensional space. From linear algebra, we know that if we take all sequences of 
length n that are orthogonal to all vectors of this k-dimensional linear subspace, the result 
will be an (n —k)-dimensional linear subspace called the orthogonal complement of the 
k-dimensional subspace. This (n — k)-dimensional subspace naturally defines an (n, n — k) 
linear code, which is known as the dual of the original (n, k) code @. The dual code is 
denoted by gt, Obviously, the code words of the original code @ and the dual code @+ are 
orthogonal to each other. In particular, if we denote the generator matrix of the dual code 
by H, which is an (n — k) x n matrix, then any code word of the original code is orthogonal 
to all rows of H, i.e., 


cH’ =0 for alle € €. (13.2.11) 
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The matrix H, which is the generator matrix of the dual code gt, is called the parity-check 
matrix of the original code €. Since all rows of the generator matrix are code words, we 
conclude that 


GH' =0. (13.2.12) 
In the special case of a systematic code, where 
G= [ te | P|; (13.2.13) 
the parity-check matrix has the following form: 
H= [ Pp! | Ik | l (13.2.14) 


Example 13.2.3 
Find the parity check matrix for the code given in Example 13.2.1. 


Solution Here, 


We conclude that 


therefore, 





The parity check equations, given by cH' = 0, are 


C182 Oc; =0 
Co Bc, =0 
C2 cs = 0. | 


Hamming Codes. Hamming codes are a class of linear block codes with 
n = 2” —1 andk = 2” — m — 1 for some m > 3, and, regardless of the value of 
m, have a minimum distance of dmin = 3. This means that for m = 3, we have a (7, 4) 
Hamming code, and for m = 4, we have a (15, 11) Hamming code. As we will see later, 
with this minimum distance, these codes are capable of providing error correction capabil- 
ities for single errors. The parity-check matrix for these codes has a very simple structure. 
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It consists of all binary sequences of length m except the all-zero sequence. The rate of 
these codes is given by 


2” —m—1 


R: = qm _ | > 


(13.2.15) 


which is close to 1 for large values of m. Therefore, Hamming codes are high-rate codes 
with a relatively small minimum distance (dmin = 3). We will see later that the minimum 
distance of a code is closely related to its error correcting capabilities. Therefore, Hamming 
codes have limited error-correcting capability. 


Example 13.2.4 


Find the parity-check matrix and the generator matrix of a (7, 4) Hamming code in the sys- 
tematic form. 


Solution In this case, m = 3; therefore, H consists of all binary sequences of length 3 
except the all-zero sequence. We generate the parity-check matrix in the systematic form as 


1011/100 
H=/1101/010/=[P'|%], 
0111/001 


and the generator matrix is 


If the information sequence x = (x1, x2, x3, x4) is encoded by this code, the resulting code 
word will be c = xG, whose components are given by 


cq =X) 
C2 = X2 
C3 = X3 
C4 = X4 


c5 =X, @ x3 ® x4 
C6 = X1 BX. BX 
C7 = X2 ® X3 ® x4. 


The parity-check equations are obtained from cH’ = 0 and are given by 


Ci @ c3 @c4@cs =0 
C1 BC2 @ c4 O c6 = 0 
c2 ® c3 O c4 Qc = Oz a 
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13.2.1 Decoding and Performance of Linear Block Codes 


The purpose of using coding in communication systems is to increase the Euclidean dis- 
tance between the transmitted signals and, hence, to reduce the error probability at a given 
transmitted power. This was shown by an example in the previous section. Referring to 
Figure 13.2, we see that this goal is achieved by choosing the code words to be as far apart 
on the vertices of the cube as possible. This means that a good measure for comparing the 
performance of two codes is the Hamming distance between code words. Keeping track 
of all distances between any two code words is difficult, and in many cases, impossible. 
Therefore, the comparison between various codes is usually done based on the minimum 
distance of the code, which, for linear codes, is equal to the minimum weight. It follows 
that for a given n and k, a code with a larger dmin (Or Wmin) performs better than a code 
with a smaller minimum distance. 


Soft-Decision Decoding. In Chapters 8 and 9, we have seen that the optimum 
signal-detection scheme on an additive white Gaussian noise channel is detection based on 
minimizing the Euclidean distance between the received signal and the transmitted signal. 
This means that after receiving the output of the channel and passing it through the matched 
filters, we choose one of the message signals that is closest to the received signal in the 
Euclidean distance sense. In using coded waveforms, the situation is the same. Assum- 
ing we are employing binary PSK for transmission of the coded message, a code word 
Ci = (Ci, Ci2,..., Cin) is mapped into the sequence s;(t) = Soi Wilt — (k — 1)T), 
where 


W(t), crK=1 
i = 13.2.16 
win) kA Cik =0 ( ) 


and y(t) is a signal of duration T and energy ©, which is equal to zero outside the interval 
(0, T]. Now the Euclidean distance between two arbitrary signal waveforms is given by 
Equation (13.1.18): 


(EY = adhe. (13.2.17) 


This gives a simple relation between the Euclidean and the Hamming distance when a 
binary PSK signaling scheme (or any antipodal signaling scheme) is employed. Now, using 


the general relation 
dE 
P =Q (Fe) ; (13.2.18) 
we obtain 


2dij@ 
No 





p(code word cj received|code word c; sent) = Q (13.2.19) 
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Since dij > dmin, and since Q(x) is a decreasing function of x, we conclude that 








p(code word c; received|code word c; sent) < Q N, (13.2.20) 
0 
Now using the union bound (see Section 8.4.2), we obtain 
2dmin 
p(error|code word c; sent) < (M — 1) Q = (13.2.21) 
0 
and assuming equiprobable messages, we finally conclude that 
2dmin6 
Py < (M -1)Q a (13.2.22) 
0 


There exists a simple relationship between the energy per code element, denoted as ©, 
and the energy per information bit, denoted as +. Since a code word has n elements, the 
total transmitted energy is n@. But a code word with n elements carries k information bits. 
Hence, kép = n&, and it follows that 


k 
€= 7 8b = Rp. (13.2.23) 
From Equations (13.2.22) and (13.2.23), we conclude that 


2dmin Re 
Py <(M—1)0 A ! (13.2.24) 


Equations (13.2.22) and (13.2.24) are bounds on the code word error probability 
of a coded communication system when optimal demodulation is employed. By optimal 
demodulation, we mean passing the received signal r(t) through a bank of matched fil- 
ters to obtain the received vector y, and then finding the closest point in the constella- 
tion to y in the Euclidean distance sense. This type of decoding that involves finding the 
minimum Euclidean distance is called soft-decision decoding, and requires real number 
computation. 


Example 13.2.5 
Compare the performance of an uncoded data transmission system with the performance of 
a coded system using the (7, 4) Hamming code given in Example 13.2.4 when applied to the 
transmission of a binary source with the rate R = 104 bits/sec. The channel is assumed to 
be an additive white Gaussian noise channel, the received power is 1 microwatt and the noise 
power spectral density is % = 107!!. The modulation scheme for the elements of any code 
word is binary PSK. 
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Solution 


1. If no coding is employed, we have 


2%, 2P 
P= — j= sy j. 2: 
2=O No o( z) (13.2.25) 
But A = won = 10; therefore, 
P2 = Q(V10) = Q(3.16) ~ 7.86 x 107. (13.2.26) 
The error probability for four bits will be 
Perrot in 4 bits = 1 — (1 — pp)* © 3.1 x 107°. (13.2.27) 


2. If coding is employed, we have dmin = 3 and 


E 6 P 
Pan ER ono eee, 
No No RN) 7 7 


Therefore, the message error probability is given by 


2dmin © 
No 


ov 3) 


= 150(4.14) © 2.6 x 107+. 


Py <(M-1)Q 


We see that using this simple code decreases the error probability by a factor of 12. Of course, 
the price that has been paid is an increase in the bandwidth required for the transmission of 
the messages. This bandwidth expansion ratio is given by 
Weode 
ad peas ONIS, a 
Wancoded Re 4 


Hard-Decision Decoding. A simpler and more frequently used decoding scheme 
is to make hard binary decisions on the components of the received vector y, and then to 
find the code word that is closest to it in the Hamming distance sense. The next example 
clarifies the distinction between soft and hard decisions. 


Example 13.2.6 
A (3,1) code consists of the two code words 000 and 111. The code words are transmitted using 
binary PSK modulation with 6 = 1. The received vector (the sampled outputs of the matched 
filters) is y = (.5, .5, —3). If soft decision is employed, we have to compare the Euclidean 
distance between y and the two constellation points (1, 1, 1) and (—1, —1, — 1) and choose the 
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smaller one. We have (d?(y, (1, 1, 1))* = .5?+.57+4? = 16.5 and (d®(y, (—1, —1, —1))? = 
1.5741.524 (—2)? = 8.5; therefore, a soft-decision decoder would decode y as (—1, —1, —1) 
or equivalently (0, 0, 0). However, if hard-decision decoding is employed, y is first compo- 
nentwise detected as 1 or 0. This requires a comparison of the components of y with the 
zero threshold. The resulting vector ¢ is therefore ê = (1, 1, 0). Now, we have to compare 
ĉ with the (1, 1, 1) and (0, 0, 0) and find the closer one in the Hamming distance sense. The 
result is, of course, (1, 1, 1). As seen in this example, the results of soft-decision decoding and 
hard-decision decoding can be quite different. Of course, soft-decision decoding is the optimal 
detection method and achieves a lower probability of error. a 


There are three basic steps involved in hard-decision decoding. First, we perform 
demodulation by passing the received r (t) through the matched filters and sampling the 
output to obtain the y vector. This is an n-dimensional vector whose components are real 
numbers. Second, we compare the components of y with the threshold (usually zero) and 
quantize each component to one of the two levels (usually 0 and 1) to obtain the estimate 
ĉ of the transmitted code word, which is an n-dimensional vector with binary components. 
Finally, we perform decoding by finding the code word that is closest to ¢ in the Ham- 
ming distance sense. In this section, we present a systematic approach to hard-decision 


decoding. 
First, we will define the notion of a standard array. Let the code words of the code 
- in question be denoted by cy, ¢2, ..., Cm, where each of the code words is of length n and 


M = 2*, and let c denote the all-zero code word. A standard array is a 2”-* x 2% array 
whose elements are binary sequences of length n and is generated by writing all the code 
words in a row starting with the all-zero code word. This constitutes the first row of the 
standard array. To write the second row, we look among all binary sequences of length n 
that are not in the first row of the array (i.e., are not code words). Choose one of these code 
words that has the minimum weight and call it e1. Write it under’ c} and write e; @c; under 
ci for2 <i < M. Thethirdrow of the array is completed in a similar way. From the binary 
n-tuples that have not been used in the first two rows, we choose one with minimum weight 
and call it e2. Then, the elements of the third row become c; @ e2. This process is continued 
until no binary n-tuples remain to start a new row. Figure 13.4 shows the standard array 
generated as explained. Each row of the standard array is called a coset and the first element 
of each coset (e;, in general) is called the coset leader. 
The standard array has several important properties. 


c O c3 cm 
e ei @ 1%) e ($>) C3 ei (> cM 
e e @ C e (> C3 (3) ($>) cM 
rs : ; eee : 
(n-k) (n-k (n-k) (n-k) $ 
eww -1 es" 0_1 © ey e)_, Beg en) Bey Figure 13.4 The standard array. 


*Note that c; ® e, = e, since c, = (0, 0, ..., 0). 
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Theorem 13.2.2. All elements of the standard array are distinct. 


Proof. Assume two elements of the standard array are equal. This can happen in two 
ways. 


1. The two equal elements belong to the same coset. In this case, we have e; ® c; = 
e, ® cj, from which we conclude c; = cj, which is impossible. 


2. The two equal elements belong to two different cosets. Here, we have ee ® c; = 
ex ® cj forl Æ k, which means e; = ex @ (c; ®c;). By linearity of the code, c; Oc; 
is also a code word; let us call it €m. Therefore, e; = ex ® Cm; hence, e, and ex belong 
to the same coset, which is impossible since, by assumption, k # I. 


a 
From this theorem, we conclude that the standard array contains exactly 2”—* rows. 


Theorem 13.2.3. If z, and zz are elements of the same coset, we have zH" = 22H’. 


Proof. It is enough to note that since zı and z; are in the same coset, z1 = €; Pc; and 
22 =e Ge; for some 1 < i, j < M. Therefore, 


zıH' = (e @ ci) H' =e,H' +0= (e ® cj) H' = 2H’. 
a 


From this theorem, we conclude that each coset of the standard array can be uniquely 
identified by the product e, H', where e; denotes the coset leader. In general, for any binary 
sequence z of length n, we define the syndrome s as 


s=zH'. (13.2.28) 


If z = e, @cj, i.e., z belongs to the (J-+1)st coset, then, obviously, s = e; H'. The syndrome 
is a binary sequence of length 2”—*, and corresponding to each coset, there exists a unique 
syndrome. The syndrome corresponding to the first coset, which consists of the code words, 
iss = 0. 


Example 13.2.7 


Construct the standard array for the (5, 2) code with the code words 00000, 10100, 01111, 
11011. Also, determine the syndromes corresponding to each coset. 


Solution The generator matrix of the code is 


and the parity-check matrix corresponding to G is 


1 110 0 
H=|0 1041 0 
01001 
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Using this construction, we obtain the standard array 


00000 01111 10100 11011 syndrome = 000 
10000 11111 00100 01011 syndrome = 100 
01000 00111 11100 10011 syndrome = 111 
00010 01101 10110 11001 syndrome = 010 
00001 01110 10101 11010 syndrome = 001 
11000 10111 01100 00011 syndrome = 011 
10010 11101 00110 01001 syndrome = 110 
10001 11110 00101 01010 syndrome = 101. a 


Assuming that the received vector y has been componentwise compared with a 
threshold and the resulting binary vector is ¢, we must find the code word which is at 
minimum Hamming distance from ĉ. First, we find in which coset ¢ is located. To do this, 
we find the syndrome of ¢ by calculating s = ¢H’. After finding s, we refer to the standard 
array and find the coset corresponding to s. Assume that the coset leader corresponding to 
this coset is e,. Because ¢ belongs to this coset, it is of the form e; ®c; for some 1 <i < M. 
The Hamming distance of ¢ from any code word c j is, therefore, 


d(È, cj) = wê 9 cj) = wle Pe; O cj). (13.2.29) 
Because the code is linear, c; ® cj = c, for some 1 < k < M. This means that, 

d(È, cj) = w(c, ® e1), (13.2.30) 
but c @ e; belongs to the same coset that ¢ belongs to. Therefore, to minimize d (ĉ, c j), we 
have to find the minimum weight element in the coset to which ¢ belongs. By construction 
of the standard array, this element is the coset leader, i.e., we choose cg = 0; therefore, 
cj = ¢;. This means that ¢ is decoded into c; by finding 


c= Oe. (13.2.31) 


Therefore, the procedure for hard-decision decoding can be summarized as follows: 


j 


. Find y, the vector representation of the received signal. 


2. Compare each component of y to the optimal threshold (usually 0) and make a binary 
decision on it to obtain the binary vector ¢. 


3. Find s = ĉH', the syndrome of é. 
4. Find the coset corresponding to s by using the standard array. 
5. Find the coset leader e and decode ¢ asc = ¢ @e. 


In this decoding scheme, the difference between the vector ¢ and the decoded vector c is 
e, so the binary n-tuple e is frequently referred to as the error pattern. This means that the 
coset leaders constitute the set of all correctable error patterns. 
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To obtain error bounds in hard-decision decoding, we note that, since a decision is 
made on each individual bit, the error probability for each bit for antipodal signaling is 


R= 01 J— 1, (13.2.32) 


P =Q $ : (13.2.33) 


The channel between the input code word c and the output of the hard limiter ¢ is a 
binary-input binary-output channel that can be modeled by a binary symmetric channel 
with crossover probability P}. Because the code is linear, the distance between any two 
code words c; and c; is equal to the distance between the all-zero code word, 0, and the 
code word c; ® cj = cz. Thus, without loss of generality, we can assume that the all-zero 
code word is transmitted. If 0 is transmitted, the error probability, by the union bound, 
cannot exceed (M — 1) times the probability of decoding the code word that is closest to 0 
in the Hamming distance sense. For this code word, denoted by c, which is at distance dyin 
from 0, we have 


and for orthogonal signaling is 


Sgt (i) PECL — Py) din, dmin odd 


min dmin dmin—i 
P(c |0 sent) < X, g i )Pi(1 — P2) 


d 


+ 4( ae )P P RER. a dmin even 


“oy 
or, in general, 
d, à 
min d 
P(c |0 sent) < 5D ( I (1 — Pa), (13.2.34) 
j=[ mitt] i 
2 
Therefore, 
d 
min d N 7 X 
Py < (M — 1) D ( =) pac — P)4min— (13.2.35) 
cae | 


This gives an upper bound on the error probability of a linear block code using hard- 
decision decoding. A simpler error bound for hard-decision decoding is given by the 
inequality 


d ri 
Pu < (M — 1) [P1 — P)? . (13.2.36) 
Equation (13.2.36) is derived in Problem 13.21. 
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As seen in both soft-decision and hard-decision decoding, dmin plays a major role in 
bounding the error probability. This means that, for a given (n, k), it is desirable to have 
codes with large dmin. 

We can show that the difference between the performance of soft- and hard-decision 
decoding is roughly 2 dB for an additive white Gaussian noise channel. That is, the error 
probability of a soft-decision decoding scheme is comparable to the error probability of a 
hard-decision scheme whose power is 2 dB higher than the soft-decision scheme. We can 
also show that if, instead of quantization of each component of y to two levels, an eight- 
level quantizer (three bits/component) is employed, the performance difference with soft 
decision (infinite precision) reduces to 0.1 dB. This multilevel quantization scheme, which 
is a compromise between soft (infinite precision) and hard decision, is also referred to as 
soft-decision decoding in the literature. 


Example 13.2.8 
If hard-decision decoding is employed in Example 13.2.5, how will the results change? 


Solution In this example, P} = o,/8 ) = Q(2.39) = 0.0084 and din = 3. Therefore, 


7 7 
Pig < (3) eta — PP + VES — P) ++ Py 
~ 21 P x 1.5 x 107. 


Thus, coding has decreased the error probability by a factor of 2 (it was 12 in the soft-decision 
case). If, instead of the exact error probability, we used the bound in Equation (13.2.36), we 
would find that 


Pis < (2* — 1) [P,(1 — P)? = 0.0114 = 11.4 x 107°. (13.2.37) 


n 
13.2.2 Some Important Linear Block Codes 


In general, hard-decision decoding of linear block codes using the standard array and syn- 
drome decoding is too complex to be used in practice for long block lengths. In order 
to make decoding of linear block codes practical, special classes of linear block codes 
for which low complexity decoding algorithms exist have been designed and decoding 
algorithms for these special classes have been devised. 

One of the most widely used subclasses of linear block codes is cyclic codes. Cyclic 
codes are linear block codes with the additional property that a cyclic shift of any code 
word is itself a code word. It turns out that cyclic codes have a rich and nice structure if 
expressed in terms of polynomials. Hard-decision decoding of cyclic codes is simpler than 
hard-decision decoding for the general class of linear block codes and can be implemented 
using simple shift-register circuits. 

A subclass of cyclic codes called BCH codes (Bose, Chaudhuri, and Hocquenghem) 
are particularly interesting and have been used extensively. BCH codes can be designed 
for correction of any given number of errors. In many books on coding theory, there are 
extensive tables for the design of BCH codes that can correct a certain number of errors. 
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There exists an elegant and fast decoding algorithm, called the Berlekamp—Massey decod- 
ing algorithm, for hard-decision decoding of the BCH codes. 

Reed-Solomon codes are a subset of nonbinary BCH codes; therefore, they belong to 
the family of cyclic codes. Unlike other codes that we have studied in this chapter, Reed- 
Solomon codes are nonbinary codes, i.e., in a code word ¢ = (c1, C2, ..., Cn), the elements 
ci are not binary 0 or 1 but each c; is itself a sequence of length k of 0’s and 1’s. Therefore, 
the size of the alphabet in Reed-Solomon codes is q = 2", and the code is a q-ary code. 
Reed-Solomon codes are particularly useful in correction of bursts of errors as studied in 
Section 13.2.4. Fading channels and storage channels are examples of channels in which 
errors tend to occur in bursts. Reed-Solomon codes are widely used in data, music, and 
video storage on CDs and DVDs. 

Reed-Solomon codes can also be concatenated with a binary code to provide higher 
levels of error protection. The binary code used in concatenation with the Reed-Solomon 
codes could be either a block code or a convolutional code. The binary encoder and decoder 
are located right before the modulator and after the demodulator, respectively, and are 
called the inner encoder—decoder pair. We will discuss concatenated codes in Section 13.4. 

The detailed structure of cyclic, BCH, and Reed-Solomon codes requires consider- 
able knowledge of finite-field theory and is treated in many standard textbooks on coding 
theory. The interested reader can refer to the books cited at the end of this chapter. 


13.2.3 Error Detection versus Error Correction 


Let € be a linear block code with minimum distance dmin. Then if c is transmitted and hard- 
decision decoding is employed, any code word will be decoded correctly if the received ¢ 
is closer to c than any other code word. This situation is shown in Figure 13.5. As shown, 
around each code word there is a “Hamming sphere” of radius ec, where ec denotes the 
number of correctable errors. As long as these spheres are disjoint, the code is capa- 
ble of correcting e, errors. A little thinking shows that the condition for nonoverlapping 
spheres is 


a +1=dmin dmin odd 


2ec +2 =dmin dmin even 











x Ain 
Oe S B S E c2 dminodd 
a ——— > 
ec ec 
p dmin 3 
Cy 2-4 C2 Amine ven 
ele te o Figure 13.5 Relation between 
ec Ec 


€c and dmin- 
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a min =, 
og E <— A 
e : Figure 13.6 Relation between e,, 
Ea ed and min + 
or 
dmin — 1 
a dmin odd 
ec = ; (13.2.38) 
Amin — 2 dmin even 
2 min 
which can be summarized as 
dmin — 1 
ec = [=| . (13.2.39) 


In some cases, we are interested in decoding procedures that can detect errors rather 
than correct them. For example, in a communication system where a feedback link is avail- 
able from the receiver to the transmitter, it might be desirable to detect whether an error 
has occurred and if so, to ask the transmitter via the feedback channel to retransmit the 
message. If we denote the error-detection capability of a code by eg, then obviously, in 
the absence of error correction, e4 = dmin — 1 because if dimin — 1 or fewer errors occur, 
the transmitted code word will be converted to a noncode word sequence and an error is 
detected. If both error correction and error detection are desirable, then there is naturally a 
trade-off between these two conditions. Figure 13.6 demonstrates this. From this picture, 
we see that 

ec + ea < dmn- 1, (13.2.40) 


with the extra condition ec < ea. 


13.2.4 Burst-Error-Correcting Codes 


Most of the linear block codes are designed for correcting random errors, i.e., errors that 
occur independently from the location of other channel errors. Certain channel models, 
including the additive white Gaussian noise channel, can be well modeled as channels 
with random errors. In some other physical channels, however, the assumption of indepen- 
dently generated errors is not a valid assumption. One such example is a fading channel, as 
discussed in Section 14.1. In such a channel, if the channel is in deep fade, a large number 
of errors occur in sequence, i.e., the errors have a bursty nature. Obviously, in this channel, 
the probability of error at a certain location or time depends on whether or not its adjacent 
bits are received correctly. Another example of a channel with bursty errors is a compact 
disc. Any physical damage to a compact disc, such as a scratch, damages a sequence of 


710 * Coding for Reliable Communications Chapter 13 


Figure 13.7 Block diagram of a system that employs interleaving/deinterleaving for burst-error correction. 


Read out bits to modulator 
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Read in coded bits 
from encoder 





Figure 13.8 A block interleaver for coded data. 


bits; therefore, the errors tend to occur in bursts. Of course, any random error-correcting 
code can be used to correct bursts of errors as long as the number of errors is less than 
half of the minimum distance of the code. But the knowledge of the bursty nature of errors 
makes it possible to design more efficient coding schemes. Two particular codes that are 
designed to be used for burst-error correction are Fire codes and Burton codes. Refer to 
Lin and Costello (2005) for a discussion of these codes. 

An effective method for the correction of error bursts is to interleave the coded data 
such that the location of errors looks random and is distributed over many code words rather 
than a few code words. In this way, the number of errors that occur in each block is low and 
can be corrected by using a random error-correcting code. At the receiver, a deinterleaver 
is employed to undo the effect of the interleaver. A block diagram of a coding system 
employing interleaving/deinterleaving is shown in Figure 13.7. 

An interleaver of depth m reads m code words of length n each and arranges them 
in a block with m rows and n columns. Then, this block is read by column and the output 
is sent to the digital modulator. At the receiver, the output of the detector is supplied to the 
deinterleaver, which generates the same m x n block structure. It then reads by row and 
sends the output to the channel decoder. This is shown in Figure 13.8. 

Assume that m = 8 and the code in use is a (15, 11) Hamming code capable of cor- 
recting one error/code word. Then, the block generated by the interleaver is an 8 x 15 
block containing 120 binary symbols. Obviously, any burst of errors of length 8 or less 
will result in a maximum of 1 error/code word; therefore, it can be corrected. If inter- 
leaving/deinterleaving was not employed, an error burst of length 8 could possibly result 


in erroneous detection in 2 code words (or error in detection of up to 22 information 
bits). 
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13.3 CONVOLUTIONAL CODES 


Convolutional codes are different from block codes because of the existence of memory 
in the encoding scheme. In block codes, each block of k input bits is mapped into a block 
of length n of output bits by a rule defined by the code (e.g., by G) and regardless of the 
previous inputs to the encoder. The rate of such a code is given by 


pet. (13.3.1) 


In convolutional codes, each block of k bits is again mapped into a block of n bits to 
be transmitted over the channel, but these n bits are not only determined by the present k 
information bits, but also by the previous information bits. This dependence on the previous 
information bits causes the encoder to be a finite state machine. 

To be more specific, the block diagram of a convolutional encoder is given in 
Figure 13.9. The convolutional encoder consists of a shift register with kL stages where 
L is called the constraint length of the code. At each instant of time, k-information bits 
enter the shift register and the contents of the last k stages of the shift register are dropped. 
After the k bits have entered the shift register, n-linear combinations of the contents of 
the shift register, as shown in the figure, are computed and used to generate the encoded 
waveform. From this coding procedure, it is obvious that the n-encoder outputs not only 
depend on the most recent k bits that have entered the encoder, but also on the (L — 1)k 
contents of the first (L — 1)k stages of the shift register before the k bits arrived. Therefore, 
the shift register is a finite state machine with 2“—! states. For each k-input bits, we have 
n-output bits, so the rate of this code is simply 

k 
Re = = (13.3.2) 








H- Lk stages >| 
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Information 
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Encoded 
sequence 
to modulator 


Figure 13.9 The block diagram of a convolutional encoder. 
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Example 13.3.1 


A convolutional encoder is shown in Figure 13.10. In this encoder, k = 1, n = 2, and L = 3, 


Therefore, the rate of the code is 5 and the number of states is 24-* = 4. One way to 


describe such a code (other than drawing the encoder) is to specify how the two output bits 
of the encoder depend on the contents of the shift register. This is usually done by specifying 
n vectors 21, 82,- -, Zn, known as generator sequences of the convolutional code. The ith, 
1 <i <2*+, component of g jə 1<j <n, is 1 if the ith stage of the shift register is connected 
to the combiner corresponding to the jth bit in the output and 0 otherwise. In this example, 
the generator sequences are given by 


g,=[1 0 1), 
gan ri 3 


13.3.1 Basic Properties of Convolutional Codes 


Because a convolutional encoder has finite memory, it can easily be represented bý a state- 
transition diagram. In the state-transition diagram, each state of the convolutional encoder 
is represented by a box, and transitions between states are denoted by lines connecting 
these boxes. On each line, both the input(s) causing that transition and the corresponding 
output(s) are specified. The number of lines emerging from each state is, therefore, equal to 
the number of possible inputs to the encoder at that state, which is equal to 2. The number 
of lines merging at each state is equal to the number of states from which a transition is 
possible to this state. This is equal to the number of possible combinations of bits that leave 
the encoder as the k bits enter the encoder. This, again, is equal to 2%. Figure 13.11 shows 
the state transition diagram for the convolutional code of Figure 13.10. 

A second, and more popular method, to describe convolutional codes is to specify 
their trellis diagram. The trellis diagram is a way to show the transition between various 
states as the time evolves. The trellis diagram is obtained by specifying all states on a 
vertical axis and repeating this vertical axis along the time axis. Then, each transition from 
a state to another state is denoted by a line connecting the two states on two adjacent 
vertical axes. In a sense, the trellis diagram is nothing but a repetition of the state-transition 
diagram along the time axes. As was the case with the state-transition diagram, we again 
have 2" branches of the trellis leaving each state and 2* branches merging at each state. In 
the case where k = 1, it is common to denote the branch corresponding to a 0 input by a 


Figure 13.10 A rate £ convolutional 
encoder. 
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Figure 13.11 State transition diagram for the 
encoder of Figure 13.10. Encoderinput and 
outputs are shown on transition lines separated 
by “P. 





a 00 





b 01 oè 


States ae 


c 10 oè 


d 11 © n 
Figure 13.12 Trellis diagram for the 


encoder of Figure 13.10. 


bold line and the branch corresponding to a 1 input to the encoder by a dashed line. Figure 
13.12 shows the trellis diagram for the code described by the encoder of Figure 13.10. 


Encoding. The encoding procedure in a convolutional code is very simple. We 
assume that the encoder, before the first information bit enters it, is loaded with zeros (the 
all-zero state). The information bits enter the encoder k bits at a time and the corresponding 
n-output bits are transmitted over the channel. This procedure continues until the last group 
of k bits is loaded into the encoder and the corresponding n-output bits are sent over the 
channel. We will assume, for simplicity, that after the last set of k bits, another set of 
k(L — 1) zeros enters the encoder and the corresponding n outputs are transmitted over 
the channel. This returns the encoder to the all-zero state and makes it ready for the next 
transmission. 


Example 13.3.2 


In the convolutional code shown in Figure 13.10, what is the encoded sequence corresponding 
to the information sequence x = (1101011)? 
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Solution It is enough to note that the encoder is in state 0 before transmission, and after 
transmission of the last information bit two 0 bits are transmitted. This means that the trans- 
mitted sequence is x; = (110101100). Using this transmission sequence, we have the code 
word ¢ = (111010000100101011). a 


The Transfer Function. For every convolutional code, the transfer function gives 
information about the various paths through the trellis that start from the all-zero state 
and return to this state for the first time. According to the coding convention previously 
described, any code word of a convolutional encoder corresponds to a path through the 
trellis that starts from the all-zero state and returns to the all-zero state. As we will see in 
Section 13.3.4, the transfer function of a convolutional code plays a major role in bounding 
the error probability of the code. To obtain the transfer function of a convolutional code, we 
split the all-zero state into two states, one denoting the starting state and one denoting the 
first return to the all-zero state. All the other states are denoted as in-between states. Corre- 
sponding to each branch connecting two states, a function of the form D” NP J is defined 
where a denotes the number of ones in the output bit sequence for that branch and £ is the 
number of ones in the corresponding input sequence for that branch. The transfer function 
of the convolutional code is, then, the transfer function of the flow graph between the start- 
ing all-zero state and the final all-zero state, and will be a function of the tree parameters 
D, N, and J and denoted by T(D, N, J). Each element of T(D, N, J) corresponds to a 
path through the trellis starting from the all-zero state and ending at the all-zero state. The 
exponent of J indicates the number of branches spanned by that path, the exponent of D 
shows the number of ones in the code word corresponding to that path (or equivalently the 
Hamming weight of the code word), and finally, the exponent of N indicates the number 
of ones in the input information sequence. Since T(D, N, J) indicates the properties of 
all paths through the trellis starting from the all-zero path and returning to it for the first 
time, then, in deriving it, any self-loop at the all-zero state is ignored. To obtain the trans- 
fer function of the convolutional code, we can use all rules that can be used to obtain the 
transfer function of a flow graph. 


Example 13.3.3 
Find the transfer function of the convolutional code of Figure 13.10. 


Solution Figure 13.13 shows the diagram used to find the transfer function of this code. The 
code has a total of four states denoted by the contents of the first two stages of the shift register. 


Figure 13.13 Flow graph for finding the transfer 
function. 
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We denote these states by the following letters: 
00 >a 
0l > b 
10> c 


ll >d. 


As seen in the figure, state a is split into states a’ and a” denoting the starting and returning 
state. Using the flow graph relations, we can write 


Xe = Xu D’ NJ +NJX, 
Xp = DJ Xa + DJX, 

Xa = DNJX,+ DNJ Xa 
Xar = D’ J X,. 


Eliminating X+, Xc, and X4 results in 





Xar D5N J? 
T(D,N, J) = E z (13.3.3) 
Xu 1— DNJ — DNJ? 
Now, expanding T (D, N, J) ina polynomial form, we obtain 
T(D, N, J) = DDN J? + DÉN? Jt + DÉN? J5 + D'N? J? ++- (13.3.4) 


The term D°N J? indicates that there exists a path through the trellis starting from the all- 
zero state and retumming to the all-zero state for the first time, which spans three branches, 
corresponding to an input-information sequence containing one 1 (and, therefore, two 0’s), and 
the code word for this path has Hamming weight equal to 5. This path is indicated with bold 
lines in the Figure 13.14. This path is somewhat similar to the minimum-weight code word 
in block codes. In fact, this path corresponds to the code word that is at “minimum distance” 
from the all-zero code word. This minimum distance, which is equal to the minimum power 
of D in the expansion of T (D, N, J), is called the free distance of the code and denoted by 
dree. The free distance of this code is equal to 5. The general form of the transfer function is, 


therefore, % 


T(D, N, J) = Ý aD NFO ye), (13.3.5) 


d=dfree 


Figure 13.14 The path corresponding to D°N J? in 
code represented in Figure 13.10. 
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A shorter form of transfer function, which only provides information aboutthe weight 
of the code words, can be obtained from T(D, N, J) by setting N = J = 1. This shorter 
form will be denoted by 


foe} 


T(D)= J aD’, (13.3.6) 


d=dfree 


and will later be used in deriving bounds on the error probabilities of the convolutional 
codes. 








Example 13.3.4 
For the code of Figure 13.10, we have 
DÎNJ? 
T (D) = —— 
1 — DNJ — DNJ? |y- 
D5 
~ 1-2D 


= D’ +2D6 +4D' +--- 


22 a 6 
= 2. D5", = 


Catastrophic Convolutional Codes. A convolutional code maps a (usually long) 
sequence of input information bits into a code word to be transmitted over the channel. The 
purpose of coding is to provide higher levels of protection against channel noise. Obvi- 
ously, a code that maps information sequences that are far apart into code words that are 
not far apart is not a good code since these two code words can be mistaken rather easily 
and the result would be a large number of bit errors in the information stream. A limit- 
ing case of this undesirable property happens when two different information sequences in 
infinitely many positions are mapped into code words that differ only in a finite number 
of positions. In such a case, since the code words differ in a finite number of bits, there 
always exists the probability that they will be erroneously decoded. This in turn results in 
an infinite number of errors in detecting the input information sequence. Codes that exhibit 
this property are called catastrophic codes and should be avoided. 

As an example of a catastrophic code consider the (2, 1) code described by 


g,=[110] 


The encoder and the state-transition diagram for this code are given in Figure 13.15. As 
seen in this diagram, there is a self-loop in state “11” that corresponds to a “1” input 


to the encoder and the corresponding output consists of all zeros. Therefore, if an input 
information stream consists of all ones, the corresponding output will be 


c = (10010000. . . 0001001). 
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Figure 13.15 Encoder and the state-transition diagram for a catastrophic code. 


If we compare this code word with the code word corresponding to the all-zero information 
sequence 


co = (0000. . . 000), 


we observe that although the two information sequences are different in a large number 
of positions, the corresponding output sequences are quite close (the Hamming distance 
being only 4); therefore, they can be mistaken very easily. The existence of such a self- 
loop (corresponding to k inputs which are not all zeros and for which the n output bits are 
all zeros) shows that a code is catastrophic and should therefore be avoided. 


13.3.2 Maximum Likelihood Decoding of Convolutional Codes—The Viterbi 
Algorithm 


In our discussion of various decoding schemes for block codes, we saw that there exists 
the possibility of soft- and hard-decision decoding. In soft-decision decoding, y, the vector 
denoting the outputs of the matched filters, is compared with the various signal points in 
the constellation of the coded modulation system and the one closest to it in Euclidean 
distance is chosen. In hard-decision decoding, y is first tamed into a binary sequence ¢ by 
making decisions on individual components of y; then the code word, which is closest to 
ĉ in Hamming distance, is chosen. We see that in both approaches, a fundamental task is 
to find a path through the trellis that is at minimum distance from a given sequence. This 
fundamental problem arises in many areas of communications and other disciplines of elec- 
trical engineering. Particularly, the same problem is encountered in maximum-likelihood 
sequence estimation when transmitting over bandlimited channels with intersymbol inter- 
ference (Section 10.4.3), speech recognition, and some pattern classification schemes, etc. 
All these problems are essentially the same and can be titled as optimal trellis searching 
algorithms. The well-known Viterbi algorithm provides a satisfactory solution to all these 
problems. 
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In hard-decision decoding of convolutional codes, we choose a path through the trel- 
lis whose code word, denoted by c, is at minimum Hamming distance from the quantized 
received sequence ¢. In hard-decision decoding, the channel is binary memoryless (the fact 
that the channel is memoryless follows from the fact that the channel noise is assumed to 
be white). Because the desired path starts from the all-zero state and returns back to the 
all-zero state, we assume that this path spans a total of m branches, and since each branch 
corresponds to n bits of the encoder output, the total number of bits in c (and also in ĉ) is 
mn. We denote the sequence of bits corresponding to the ith branch by c; and ¢;, respec- 
tively, where 1 <i < m and each c; and ¢; is of length n. The Hamming distance between 
c and ¢ is therefore 


d(c,é) = ) d(c, ĉi). (13.3.7) 
i=1 


In soft-decision decoding, we have a similar situation, but with three differences: 


1. Instead of ¢, we are dealing directly with the vector y, which is the vector output of 
the optimal (matched filter type or correlator type) digital demodulator. 


2. Instead of the binary sequence c, we are dealing with the corresponding sequence c’ 


with 
VE, Cij = 1 
sare, Cij = 0 
3. Instead of Hamming distance, we are using Euclidean distance. This is a conse- 


quence of the fact that the channel under study is an additive white Gaussian noise 
channel. 


for! <i<mand1 <j <n. 


Thus, we have 


dz(c’, y) =) d? (ej, yi). (13.3.8) 
i=1 


From Equations (13.3.7) and (13.3.8), we see that the generic form of the problem we 
have to solve is as follows: Given a sequence a, find a path through the trellis, start at the 
all-zero state 5; = 0 and end at the all-zero state Sm = 0, such that some distance measure 
between a and a sequence b corresponding to that path is minimized.> The important fact 
that makes this problem easy to solve is that the distance between a and b in both cases of 
interest can be written as the sum of the distances corresponding to individual branches of 
the path. This is easily observed from Equations (13.3.7) and (13.3.8). 

Assume that the general problem is formulated as minimizing a metric p in the form 


(a, b) = (ai, bi), 
i=l 


>The problem can also be formulated as a maximization problem. For example, instead of minimizing the 
Euclidean distance, we could maximize the correlation. 
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where, for soft- and hard-decision decoding, 2 represents the Euclidean distance and the 
Hamming distance, respectively. First, we observe that if the path (S1 = 0, S; = 1, Sm = 
0), 1 <i < m, is the optimal path starting from the all-zero state, terminating at the all- 
zero state, and passing through state / at time i, then the metric contribution of part of this 
path (Sı = 0, S; = J) is lower than the metric contribution of any other path connecting 
Sı = 0 to S; = l and denoted by (Sı = 0, S; = 1). Otherwise, the path consisting of the 
concatenation of (Sı = 0, S; = J) and (S; = 1, Sm = 0) would be the optimal path. This is 
shown in Figure 13.16. 

This shows that at each state / at time i, only one path connecting the all-zero state to 
state l should be saved. This path is the path corresponding to the lowest metric connecting 
Sı = 0 to S; = l, i.e., the path at the minimum Hamming, or Euclidean, distance from the 
received sequence. The path of minimum metric connecting Sı = 0 to S; = l is called the 
survivor path, or simply the survivor, at S; = L. 

Let A;—; denote the set of states at time i — 1 that are connected with a branch 
to S; = L. If the path from Sı = 0 to S; = l is a survivor, then this path must be the 
concatenation of a survivor path at S;_; = A, for some A € A;_1, and the branch connecting 
Si—1 = à to S$; = l. Therefore, in order to find the survivor path at state S; = l, it is 
sufficient to have the survivors (and their metrics) for all S;_; =A, A € Aj_, then append 
them to the branches connecting the elements of A;_; to S$; = J, find the metric of the 
resulting paths from Sı = 0 to $; = l, and pick the one with the minimum metric. This 
will be the new survivor path at S; = l. This process is started at Sı = 0 and is finished at 
Sm = 0. The final survivor at Sm = 0 represents the optimal path and the best (maximum 
likelihood) match to the received sequence. This process is shown in Figure 13.17. 

Ateach step, the new survivor metric is 


(Sı = 0,5; = 1) = min (M(S1 = 0, Sin =A) +U(S = 2,51 =D}. 13.3.9) 


Having found the survivor metric, the new survivor at S; = l isthe path (S1 = 0, S;_; = À, 
Si = 1) for the À that minimizes the survivor metric in Equation (13.3.9). 
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Figure 13.16 Comparison of the optimal path (S, = 0, S; = L, S,, = 0) with a suboptimal path consisting of 
the concatenation of (S, = 0, S; = L) and (S; = l, Sm = 0). 
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Figure 13.17 Finding a new survivor 
from old survivors. 


This procedure can be summarized in the Viterbi algorithm: 


. Parse the received sequence into m subsequences, each of length n. 
. Draw a trellis of depth m for the code under study. For the last L — 1 stages of the 


trellis, draw only paths corresponding to the all-zero input sequences. [This is done 
because we know that the input sequence has been padded with k(L — 1) zeros.] 


. Seti = 1 andset the metric of the initial all-zero state equal to zero. 
. Find the distance from the ith subsequence of the received sequence to all branches 


connecting the ith stage states to the (i + 1)st stage states of the trellis. 


. Add these distances to the metrics of the ith stage states to obtain the metric candi- 


dates for the (i + 1)st stage states. For each state of the (i + 1)st stage, there are 2* 
candidate metrics, each corresponding to one branch ending at that state. 

For each state at the (i + 1)st stage, choose the minimum of the metric candidates; 
then, label the branch corresponding to this minimum value as the survivor and 
assign the minimum of the metric candidates as the metrics of the (i + 1)st stage 
states. 


. Ifi = m, goto Step 8. Otherwise, increase i by 1 and go to Step 4. 


Starting with the all-zero state at the final stage, go back through the trellis along the 
survivors to reach the initial all-zero state. This path is the optimal path and the input 
bit sequence corresponding to it is the maximum likelihood decoded information 
sequence. To obtain the most likely input bit sequence, remove the last k(L — 1) 
zeros from this sequence. 


As seen from this algorithm, the decoding delay and the amount of memory required 


for decoding a long information sequence is unacceptable. The decoding cannot be started 
until the whole sequence (which, in the case of convolutional codes, can be very long) is 
received, and all surviving paths have to be stored. In practice, a suboptimal solution that 
does not cause these problems is desirable. One such approach, called path memory trun- 
cation, is that the decoder at each stage only searches ô stages back in the trellis instead 
of searching back to the start of the trellis. With this approach, at the (6 + 1)th stage, the 
decoder makes a decision on the input bits corresponding to the first stage of the trellis 


Section 13.3 Convolutional Codes j 721 


(the first k bits) and future received bits do not change this decision. This means that the 
decoding delay will be kô bits, and it is required only to keep the surviving paths cor- 
responding to the last 6 stages. Computer simulations have shown that if 6 ~ SL, the 
degradation in performance due to path memory truncation is negligible. 


Example 13.3.5 
Assume that, in hard-decision decoding, the quantized received sequence is 


ĉ = (01101111010001). 


The convolutional code is given in Figure 13.10. Find the maximum likelihood information 
sequence and the number of errors. 


Solution The code is a (2, 1) code with L = 3. The length of the received sequence ĉ is 
14. This means that m = 7 and we have to draw a trellis of depth 7. Also note that because 
the input information sequence is padded with k(L — 1) = 2 zeros, for the final two stages of 
the trellis, we will only draw the branches corresponding to all-zero inputs. This also means 
that the actual length of the input sequence is 5, which, after padding with two zeros, has 
increased to 7. The trellis diagram for this case is shown in Figure 13.18. The parsed received 
sequence ĉ is also shown in this figure. In drawing the trellis in the last two stages, we have 
considered only the zero inputs to the encoder. In the final two stages, there are no dashed 
lines corresponding to 1 inputs. Now the metric of the initial all-zero state is set to zero and 
the metrics of the next stage are computed. In this step, there is only one branch entering each 
state; therefore, there is no comparison, and the metrics, which are the Hamming distances 
between that part of the received sequence and the branches of the trellis, are added to the 
metric of the previous state. In the next stage, there is no comparison either. In the third stage, 
we actually have two branches entering each state. This means that a comparison has to be 
made, and survivors are to be chosen. From the two branches that enter each state, the one that 
corresponds to the least total accumulated metric remains as a survivor and the other branches 
are deleted (marked with 4- on the graph). If, at any stage, two paths result in the same metric, 
each one of them can be a survivor. Such cases have been marked by a “?” in the trellis 
diagram. The procedure is continued to the final all-zero state of the trellis. Starting from 
that state, we move along the surviving paths to the initial all-zero state. This path, which 
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Figure 13.18 The trellis diagram for the Viterbi decoding of the sequence (01101111010001). 
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is denoted by a heavy path through the trellis, is the optimal path. The input bit sequence 
corresponding to this path is 1100000 (where the last two zeros are not information bits, but 
are added to return the encoder to the all-zero state). Therefore, the information sequence 
is 11000. The corresponding code word for the selected path is 11101011000000, which is 
at Hamming distance 4 from the received sequence. No other path through the trellis is at a 
Hamming distance less than 4 from the received ĉ. Py 


For soft-decision decoding, a similar procedure is followed with squared Euclidean 
distance substituted for Hamming distance. 


13.3.3 Other Decoding Algorithms for Convolutional Codes 


The Viterbi algorithm provides maximum-likelihood decoding for convolutional codes. 
However, as we have already seen, the complexity of the algorithm is proportional to the 
number of states in the trellis diagram. This means that the complexity of the algorithm 
increases exponentially with the constraint length of the convolutional code. Therefore, 
the Viterbi algorithm can be applied only to codes with low constraint lengths. For higher 
constraint-length codes, other suboptimal decoding schemes have been proposed. These 
include the sequential decoding of Wozencraft (1957), the Fano algorithm (1963), the 
stack algorithm [Zigangirov (1966) and Jelinek (1969)], the feedback-decoding algorithm 
[Heller (1975)], and majority logic decoding [Massey (1963)]. 


13.3.4 Bounds on the Error Probability of Convolutional Codes 


Study of error performance of convolutional codes is different from that of block codes 
since for these codes there exists no block structure; instead, a long sequence of information 
bits is mapped into a longer binary sequence and then transmitted. The number of bit errors 
in decoding of the information sequence is a random variable that depends on both the 
channel noise and the length of the transmitted sequence. The longer the input sequence, 
the higher the probability of making errors in that sequence. Therefore, it makes sense to 
normalize the number of bit errors by the length of the input sequence. 

A commonly used measure for comparing the performance of convolutional codes is 
the average bit error probability, defined as the expected number of erroneously decoded 
bits per transmitted bit. To find a bound on this quantity, we first derive a bound on the 
average number of bits in error for each information sequence of length k and then nor- 
malize the result by dividing it by k. We begin by assuming that the all-zero sequence is 
transmitted and up to stage l in the decoding process there has been no error. At this point 
k new information bits enter the encoder and in the decoding, we move to the next stage 
in the trellis. We are interested in finding a bound on the expected number of errors due to 
transmission of these k bits. Because we are assuming that the all-zero sequence is trans- 
mitted and there has been no error up to stage l, the all-zero path through the trellis must 
have the minimum metric up to the /th stage. Moving to the next stage, i.e., the (J + 1)st 
stage. It is possible that another path through the trellis will have a metric less than the 


Because of the linearity of convolutional codes we can, without loss of generality, make this assumption. 
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Figure 13.19 The path 
corresponding to the first-error 
e e e ° e e event. 


all-zero path and, therefore, cause errors. If this happens, we must have a path through the 
trellis that merges with the all-zero path, for the first time, at the (J + 1)st stage which has a 
metric less than the all-zero path. Such an event is called the first-error event and the cor- 
responding probability is called the jirst-error-event probability. This situation is depicted 
in Figure 13.19. 

Our first step would be bounding the first-error-event probability. Let P,(d) denote 
the probability that a path through the trellis, which is at Hamming distance d from the 
all-zero path, is the survivor at the (J + 1)st stage. Denoting the number of paths of weight 
d by ag, we can bound the first-error-event probability by 


Pe< J` aaPr(d), (13.3.10) 


d=dfree 


where, on the right-hand side, we have included all paths through the trellis that merge 
with the all-zero path at the (J + 1)st stage. The value of P,(d) depends on whether soft- 
or hard-decision decoding is employed. 

For soft-decision decoding, if antipodal signaling (binary PSK) is used, we have 


E 
P,(d) = o(e) 


J/2No 

2éd 

=Q | /— 

No 

Sp 
= 2R.d— |; 13.3.11 
Q No ( ) 
therefore, 
œœ E, 

P, < 2R.d— Į. 13.3.12 
<} ug as. ( ) 
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Using the upper bound on the Q function, we have 


i e- Red by /No 
No 2 : 
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we finally obtain 
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Chapter 13 


(13.3.13) 


(13.3.14) 


(13.3.15) 


This is a bound on the first-error-event probability. To find a bound on the aver- 
age number of bits in error for k-input bits, P,(k), we note that each path through the 
trellis causes a certain number of input bits to be decoded erroneously. For a general 
D4N/@ J8(® in the expansion of T (D, N, J), there are f(d) nonzero-input bits. This 
means that the average number of input bits in error can be obtained by multiplying the 
probability of choosing each path by the total number of input errors that would result if 
that path were chosen. Hence, the average number of bits in error, in the soft-decision case, 


can be bounded by 
z = Sp 
Prk) < J, afd | [2Red 
=dfree 4 
1 [>e] 
2a Yo aroe rim, 
d=dfree 
If we define 
T,(D, N) = T(D, N, J)\ja1 
Cc 
= >> aD*ni®, 
d=dfree 
we have 
dT,(D, N) z d ar f(d)—1 
-a = > ag f(d) DİN l 


d=dfree 


Therefore, using Equations (13.3.16) and (13.3.18), we obtain 
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(13.3.16) 


(13.3.17) 


(13.3.18) 


(13.3.19) 
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To obtain the average number of bits in error for each input bit, we have to divide this 
bound by k. Thus, the final result is 


fas 1 07%2(D,N) 
Py = SO ; 13.3.20 
b= k ƏN -R Ob ( ) 
N=1,D=e o 
For high SNRs, the first term corresponding to the minimum distance is the dominant term, 
and we have the approximation 


= 1 
Pp ~ = dani f (dmin)e Pe min Co /N0 (13.3.21) 
For hard-decision decoding, the basic procedure follows this derivation. The only 
difference is the bound on P>(d). It can be shown that (see Problem 13.19) P2(d) can be 
bounded by 


P(d) < [4p — p)i*”, (13.3.22) 


where p is the crossover probability of the binary symmetric channel. Using this result, 
it is straightforward to show that in hard-decision decoding, the probability of error is 
upperbounded as 


1 072(D, N) 


P<- : 
k aN N=1,D=./4p0—p) 


(13.3.23) 


A comparison of hard-decision decoding and soft-decision decoding for convolutional 
codes shows that here, as in the case for linear block codes, soft-decision decoding out- 
performs hard-decision decoding by a margin of roughly 2 dB in additive white Gaussian 
noise channels. 


Convolutional Codes with Good Distance Properties. From this analysis, it 
is obvious that dfree plays a major role in the performance of convolutional codes. For a 
given n and k, the free distance of a convolutional code depends on the constraint length of 
the code. Searching for convolutional codes with good distance properties has been exten- 
sively carried out in the literature. Tables 13.1 and 13.2 summarize the result of computer 
simulations carried out for rate l and rate 1 convolutional codes. In these tables, for each 
constraint length, the convolutional code that achieves the highest free distance is tabu- 
lated. For this code, the generators g; are given in octal form. The resulting free distance 
of the code is also given in these tables. 


13.4 GOOD CODES BASED ON COMBINATION OF SIMPLE CODES 


As we have seen in the preceding sections, the performance of block and convolutional 
codes depends on the distance properties of the code and, in particular, the minimum dis- 
tance in block codes and the free distance in convolutional codes. In order to design block 
codes with a given rate and with high minimum distance, we must increase n, the block 
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TABLE 13.1 RATE 5 MAXIMUM FREE DISTANCE CODES 





Constraint Length L Generators in Octal bee 
3 5 7 5 
4 15 17 6 
5 23 35 7 
6 53 75 8 
7 133 171 10 
8 247 371 10 
9 561 753 12 

10 1167 1545 12 
11 2335 3661 14 
12 4335 5723 15 
13 10533 17661 16 


14 21675 27123 16 


Odenwalder (1970) and Larsen (1973). 


TABLE 13.2 RATE i MAXIMUM FREE DISTANCE CODES 


Constraint Length L Generators in Octal sree 
3 5 7 7 8 
4 13 15 17 10 
5 25 33 37 12 
6 47 53 75 13 
7 133 145 175 15 
8 225 331 367 16 
9 557 663 711 18 

10 1117 1365 1633 20 
11 2353 2671 3175 22 
12 4767 5723 6265 24 
13 10533 10675 17661 24 
14 21645 35661 37133 26 


Odenwalder (1970) and Larsen (1973). 


length of the code. Increasing n increases the complexity of the decoding. In most decod- 
ing algorithms, the complexity of the decoding increases exponentially with increasing the 
block length of the code. 

For convolutional codes, increasing the free distance at a given rate requires increas- 
ing the constraint length of the code. But increasing the constraint length of the code 
increases the number of states in the code trellis, which in tum increases the decoding 
complexity. Again, the decoding complexity increases exponentially with the constraint 
length of the convolutional code. 
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Various methods have been proposed to increase the effective block length of the 
code while keeping the complexity tractable. Most of these methods are based on com- 
bining simple codes to generate more complex codes. The decoding of the resulting code 
is performed by using methods for decoding the simple component codes. The resulting 
decoding is a suboptimal decoding scheme, which usually performs satisfactorily. Here, we 
discuss three widely used methods for combining simple codes to generate more complex 
codes. These techniques generate product codes, concatenated codes, and turbo codes. 


13.4.1 Product Codes 


The structure of product codes (or array codes) is very similar to a crossword puzzle. 
Product codes are generated by using two linear block codes arranged in a matrix form. 
Two linear block codes, one with parameters n; , kı, and dmin ı and another with parameters 
n2, k2, and dmin2, are used in a matrix of the form shown in Figure 13.20. The resulting 
code is an (1:2, kık2) linear block code. We can show that the minimum distance of 
the resulting code is the product of the minimum distances of the component codes, i.e., 
dmin = Amin 1dmin2, and is capable of correcting L , using optimal hard-decision 
decoding. But we can also decode using the properties of the component codes, as we 
would solve a crossword puzzle. Using the row codes, we can make the best guess for 
the bit values; then, using the column codes, we can improve these guesses. This process, 
which can be repeated in an iterative fashion, improving the quality of the guess in each 
step, is known as iterative decoding and is very similar to the way a crossword puzzle is 
solved. To employ this decoding procedure, we need decoding schemes for the row and 
column codes that are capable of providing guesses about each individual bit. In other 
words, decoding schemes with soft outputs (usually, the likelihood values) are desirable. 
In Section 13.5, we will describe such decoding procedures in our discussion of turbo 
codes. 


ky 


m- ky 


ky m=ki Figure 13.20 The structure of a product code. 
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13.4.2 Concatenated Codes 


A concatenated code consists of two codes, an inner code and an outer code, connected 
serially, as shown in Figure 13.21. The inner code is a binary block or convolutional code 
and the outer code is typically a Reed-Solomon code. If the inner code is an (n, k) code, 
the combination of the inner encoder, digital modulator, waveform channel, digital demod- 
ulator, and the inner decoder can be considered as a channel whose input and output are 
binary blocks of length k or, equivalently, elements of a q-ary alphabet where q = 2k, 
Now the Reed-Solomon code (the outer code) can be used on this q-ary input, g-ary out- 
put channel to provide further error protection. It can easily be seen that if the rates of the 
inner and the outer codes are re and Re, respectively, the rate of the concatenated code 
will be 


Rec = Rere. (13.4.1) 


Also, the minimum distance of the concatenated code is the product of the minimum dis- 
tances of the inner and the outer codes. In concatenated codes, the performance of the 
inner code has a major impact on the overall performance of the code. That is why convo- 
lutional codes with soft decoding using the Viterbi algorithm are commonly employed for 
the inner code. 


13.5 TURBO CODES AND ITERATIVE DECODING 


Shannon’s random coding theorem states that codes achieving channel capacity must have 
random, or close to random, structure as well as large block lengths. However, in gen- 
eral, for a randomly generated code, the decoding complexity grows exponentially with 
the block length of the code. Hence, lack of structure, as well as large block length, ren- 
ders maximum likelihood (ML) decoding of capacity achieving codes impractical. Turbo 
coding is a method to combine two simple codes connected by a pseudorandom interleaver 
of large length to generate a code with close-to-random structure as well as large block 
length. However, because the resulting code is based on combining simple codes, as dis- 
cussed in the preceding section, its decoding is possible by an iterative scheme based on 
the decoding of its constituent codes. This decoding method, called iterative decoding, or 
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Figure 13.21 Block diagram of a communication system with concatenated coding. 
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Figure 13.22 Encoder for parallel concatenated code (turbo code). 


turbo decoding, is not optimal but for many codes, after a few iterations, has a performance 
close to ML decoding. 

Parallel concatenated convolutional codes (PCCC) with interleaving, also called 
turbo codes, were introduced by Berrou et al. (1993). A turbo encoder, shown in Figure 
13.22, is an encoder that employs two recursive systematic convolutional encoders in par- 
allel, where the second encoder is preceded by an interleaver. The two constituent encoders 
are usually identical with rate 1/2. We observe that since information bits at the output of 
the two encoders are sent only once, the nominal rate at the output of the turbo encoder, 
before puncturing, is Re = 1/3. However, by puncturing the parity-check bits at the out- 
put of the binary convolutional encoders, we may achieve higher rates, such as 1/2 or 2/3. 
The interleaver, denoted by |], is usually selected to be a pseudorandom interleaver that 
reorders the bits in the information sequence before feeding them to the second encoder. In 
effect, the use of two recursive convolutional encoders in conjunction with the interleaver 
produces a code that contains very few code words of low weight. This characteristic does 
not necessarily imply that the free distance of the concatenated code is especially large. It, 
however, results in code words that have relatively few nearest neighbors. In other words 
the number of code words at small distance from any code word is very low, and hence, the 
code words are relatively sparse. Hence, the coding gain achieved by a turbo code is due 
in part to this feature, i.e., the reduction in the number of nearest neighboring code words, 
called the multiplicity, that results from interleaving. 

Recursive systematic convolutional codes are systematic convolutional codes, mean- 
ing the input bits appear directly as part of the encoded bits, in which the parity-check 
bits are generated by a recursive (feedback) linear filter. An example of a rate 1/2 RSCC 
is shown in Figure 13.23. In this encoder c = u; hence, it is systematic. It is also 
recursive because the parity-check bits c® are generated using a linear feedback shift 
register. 
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c®) Figure 13.23 A recursive systematic 
> convolutional encoder. 





Figure 13.24 A 27/31 RSC encoder. 


The constituent codes in turbo codes are usually described by the ratio of the octal 
representation of the feedforward connection to the octal representation of the feedback 
connection of the encoder. For example, a 27/31 RSC encoder has feedback and feedfor- 
ward connections of 31 ~ g} = (11001) and 27 ~ gy = (10111), respectively. The 
encoder is given by the block diagram shown in Figure 13.24. In this Figure c? denotes the 
stream of parity check bits generated by the encoder. Using this notation it is clear that the 
encoder of Figure 13.23 is a 5/7 recursive systematic convolutional encoder. 

The interleaver in turbo codes is usually very long, in the order of thousands of bits. 
Pseudorandom interleavers perform well, although some improvement in the performance 
of the code can be obtained by clever choice of the interleaver. Such improvement is more 
noticeable at short interleaver lengths. Also note that unlike nonrecursive convolutional 
codes in which padding a sequence of zeros to the message sequence guarantees that the 
encoder returns to the all-zero state, for RSCC’s returning the encoder to the all-zero state 
requires padding the information sequence with a particular nonzero sequence. Due to the 
existence of the interleaver, it is, in most cases, impossible to return both encoders to the 
all-zero state. 
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Example 13.5.1 


Generate the parity-check bits when the binary sequence u is encoded by the 5/7 RSCC shown 
in Figure 13.23. 


u=[01110010011001001111). 


Assume that the encoder starts from the zero state. 


Solution Passing u through the shift register starting at the all-zero state, we obtain the 
parity-check bits as 


c®=[01111110100011111100]). n 


Since turbo codes have two constituent-code components, an iterative algorithm is 
appropriate for their decoding, as discussed in Section 13.4. Any decoding method that 
yields the likelihood of the bits as its output can be used in the iterative decoding of turbo 
codes. One such decoding scheme is the maximum a posteriori probability (MAP) decoding 
method of Bahl, Cocke, Jelinek, and Raviv (BCJR), as described in Bahl et al. (1974). 
Another popular method with lower complexity (and degraded performance) is the soft- 
output Viterbi algorithm (SOVA) of Hagenauer and Hoher (1989). Using either method at 
the first decoder, the likelihoods of different information bits are computed and passed to 
the second decoder. The second decoder computes the likelihood ratios and passes them to 
the first decoder; this process is repeated until the likelihoods suggest high probability of 
correct decoding for each bit. At this point, the final decision is made. 


13.5.1 MAP Decoding of Convolutional Codes—The BCJR Algorithm 


The BCJR algorithm, named after Bahl et al. (1974), is a symbol by symbol maximum a 
posteriori (MAP) decoding algorithm for convolutional codes. In this algorithm the decoder 
uses the MAP algorithm to decode each input symbol to the decoder rather than looking 
for the the most likely input sequence. 

We assume that the set of states in the convolutional encoder is denoted by S. Because 
the encoder inputs can be either 0 or 1, transition from stage i — 1 to stage i (i.e., from 
o;-1 E S too; € S) can be either due to u; = 0 or u; = 1. Let us denote by Se the set of all 
(o;-1, 0;) pairs corresponding to u; = £ for £ = 0, 1. 

The symbol-by-symbol maximum a posteriori decoder receives the sequence y = 
(Y1; Y2, ---, Yy), the demodulator output, and based on this observation decodes u; using 
the maximum a posteriori rule 


u= P(u, 
it are nee (uily) 
EET plui, Y) 
wE p(y) 


= max 5 
arg ma plui, Y) 





(13.5.1) 


= arg max ) P(oi-1, Oi, Y), 
te{0, 1} 
(oj-1,0;)€Sg 
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where the last equality follows from the fact that u; = £ corresponds to all pairs of states 
(o;-1, Oi) € Se for £ =0, 1. 


If we define 
yi 1) = (J; stad yeD), 
(13.5.2) 
Yin = (Yii Iw) > 
we can write 
1 N 
y= (IE, yoI) (13.5.3) 
and we have 
P (i-i; Ci, Y) = p (o1, 0i, YET P, ya 9) 


(i—1) 


N 1 
= p \ i- 1 Oi, Y] 31) p (Miloi 015 9 x, yi) 


1 
= p(ai- KIF D) p (a:, yiloi- l; yË Yo (yMloia.oi, yf P, yi) 


= p (oim, yË D) p (oi, yiloim) p (yilo); 
(13.5.4) 


where the first three steps follow from the chain rule and the last step follows from Markov 
properties of the states in a trellis. 

At this point we assume the received sequence y is fixed and for this sequence we 
define three functions &œ;—1ı (o;-1), B; (Oi), and y; (o;~1, di) as 


Qi-ı (Fj-1) = p(o- n ËT ue 
Pi Gi) = p (y%lo:) P (13.5.5) 
Yi (Gi-1, 91) = p (i, Yiloi-1) - 
Using these definitions in Equation (13.5.4) we have 
P (Oi-15 Oi» Y) = Qi—1 (i-1) Yi (Gi-1, Gi) Bi i) (13.5.6) 


and hence, from Equation (13.5.1) we obtain 


aj =argmax DP a1 (1-1) Yi Cii, o) Bi (1). (13.5.7) 
(oj-1,0j)ESe 


Equation (13.5.7) indicates that for maximum a posteriori decoding we need the values of 
Qi—ı (Gi-1), Bi (Gi), and yi (Fi-1, 9%). 


The Forward Recursion for œ; (o;): We show that a; (o;) can be obtained using a forward 
recursion of the form 


ai= $, vi Ci- o) ai (0i), 1Si<N. (13.5.8) 


oj-1€S 
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To prove Equation (13.5.8) we use the following set of relations: 
_ © 
ai (o;) = p (ox, Ji ) 


1 
= 5 p(o- 1,0;, YË yi) 


oj-1€S 


1 
= X p(o- yer D) p (oi, yiliz, yf ’) 


Z (13.5.9) 
= -1) 
= D p(o- Ly )p (oi, yiloi-1) 

oj-1€S 
= ye Qi—1 (01-1) Yi (Fi-1, 9%) 5 

oj-1€S 


which completes the proof of the forward recursion relation for œ; (o;). This relation means 
that, if we have the values of y; (o;-1, o;i), it is possible to obtain a; (o;) from aj-1 (j-1). 
If we assume that the trellis starts in the all-zero state, the initial condition for the forward 
recursion becomes 


1 o=0 


(13.5.10) 
0 040 


æo (59) = P (09) = | 


Equations (13.5.8) and (13.5.10) provide a complete set of recursions for computing the 
values of œ. 


The Backward Recursion for £; (0;): The backward recursion for computing the values 
of $ is given by 


Bi-1 Gi-1) = DB; (i) Vi(Gi-1.01),. 1Si SN. (13.5.11) 


oj€S 


To prove this recursion we note that 
N 
Bi-1 Gi-1) = p (9 : loi ) 


= (N) 
= >: P (yi. Jit i loi-1) 


oj¢€S 
= Lp (0i yiloi-1) p (ISA lov. yi 0-1) (13.5.12) 
= > p (oi, Ji loi-1) P (e lo: ) 

oj€S 


= D Vi (Ti-1, 0;) B; (aj) - 


Oj eS 


734 Coding for Reliable Communications Chapter 13 


The boundary condition for the backward recursion, assuming that the trellis is ter- 
minated in the all-zero state, is 


1 on =0 


; 13.5. 
ee (13.5.13) 


By (on) = | 

The recursive relations (13.5.8) and (13.5.11) together with initial conditions 

(13.5.10) and (13.5.13) provide the necessary equations to determine œ’s and B’s when 
y’s are known. We now focus on computation of y’s. 


Computing y; (o;_1, oi): We can write yi (0;-1, 0;),1 <i < N, as 
Vi (Gi-1, 01) = p (0%, y;loi-1) 
= p (iloi-1) p (yildi, 01-1) 
= P(ui)p (yilui) 
= P(ui)p (y;lci) . 


where we have used the fact that there exists a one-to-one correspondence between a pair 
of states (oj_1, oj) and the input u;. The above expression clearly shows the dependence of 
Yi (Ci-1, 04) on P(u;), the prior probability of the information sequence at time i, as well 
as p ( Ji Ici) (which depends on the channel characteristics.) If the information sequence is 
equiprobable, an assumption that is usually made when no information is available, then 
P(u; = 0) = P (u; = 1) = L. Obviously, the above derivation is based on the assumption 
that the state pair (0;-1, 0;) is a valid pair, i.e., a transition from g;—1 to o; is possible. 

Equation (13.5.7) together with the forward and backward relations for œ and £ given 
in Equations (13.5.8) and (13.5.11) and Equation (13.5.14) for y are known as the BCJR 
algorithm for symbol-by-symbol MAP decoding of a convolutional code. 

Note that unlike the Viterbi algorithm that looks for the most likely information 
sequence, the BCJR finds the most likely individual bits. The BCJR algorithm also provides 
the values of P (u;|y). These values provide a level of certainty of the decoder about the 
value of u; and are called soft outputs or soft values. Having P (ui|y), we can find the a 
posteriori likelihood values (L-values) as 


P (uy; = 1|y) 
P (uy; = Oly) 
= mi (ui = 1, y) 
P (ui = 0, y) 
D ai-1 (01-1) Yi Gi-1, 0%) Bi i) 
(6; -1,0)ES] 


D ai—1 (;-1) Vi (Ci-1, 0%) Pi (oi) 


(o;-1,01) €So 


(13.5.14) 


L(u;) = In 


(13.5.15) 


=In 


which are also referred as soft outputs. Knowledge of soft outputs is crucial in decoding of 
turbo codes discussed later in this chapter. A decoder such as the BCJR decoder that accepts 
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soft inputs (the vector y) and generates soft outputs is called a soft-input-soft-output (SISO) 
decoder. Note that the decoding rule based on L(u;) soft values is given by 


(13.5.16) 


‘10 Lj) <0 


For an AWGN channel we have y = c + n, where c represents the modulated signal 
_ corresponding to the encoded sequence. In this channel model 


P (u;i) ly; — cil? 
Yi (Ci-1, Oj) = TMy exp (2L : (13.5.17) 


For the special case when n = 2, the convolutional code is systematic, and the 
modulation is BPSK, direct substitution results in 


2 2 
1 yi) +O) +28 2ye? + 2yPe? 
Vi (Oj-1, 0i) = No exp | ( ) be) P(u;) exp ( Yi i Yi Ci f 





(13.5.18) 


where superscripts s and p indicate systematic and parity-check components of y and c. 
Direct substitution of Equation (13.5.18) into Equation (13.5.15) yields 


2y? 
z D aao) exp (ES ee a (i) 
Avey; In Plu; = 1 ey cess 
Sry SO aiii) exp (ia =) (1) 


(Ti —1,0;)E€S0 





L(u;) = 


(13.5.19) 


Equation (13.5.19) shows that the log likelihood ratio (LLR) of the information bits - 
is the sum of three terms: the first term that depends only on the systematic bits, the second 
term that depends only on the prior probabilities, and the third term that depends on the 
parity-check bits. 

One problem with the version of the BCJR algorithm described above is that it is not 
a numerically stable algorithm, particularly for long trellises. An alternative to this algo- 
rithm is the log-domain version of it, known as the log-APP (log a posteriori probability) 
algorithm or the log-MAP algorithm. 

In the log-APP algorithm, instead of œ, 6, and y, we define their logarithms as 


Či (0) = In (a; (04) , 


Bi (61) = In (6 (3), (13.5.20) 
Fi (Ti—1, 01) = In (yi (i-1, 0;)) . 
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Straightforward calculation shows that the following forward and backward recursions 
hold for @ (o;) and f; (o;-1): 


či) =In| D> exp @-1 i-1) + % Gi-1,.1)) | , 


oj-1€S 
(13.5.21) 
B;-1 (@i-1) = In D exp (B; (01) + P (i-1, 01)) 
oj€S 
with initial conditions 
mo 0 090 = 0 ~ 0 ON = 0 
00) = on) = ; 13.5.2 
ag (c0) ae nee Bn (on) G MES ( 2) 
and the a posteriori L-values are computed as 
Lu=m| XO exp (či i-1) +% Gi-1, 01) + Bi (01)) 
(oj-1,0))€S) 
(13.5.23) 


~In D exp (či-1 (0i-1) + 7; (Gi-1, 01) + Bi (0;)) 


(o;-1,0;)€Sp 


These relations are numerically more stable but are not computationally efficient. In 
order to improve the computational efficiency, we can introduce the following notation: 


max*{x, y} £ ln (e + e) ; 


(13.5.24) 
max*{x, y, z} £ In (e* +e + e) P 
Using these definitions, we have the recursions 
a; (Oi) = max" {@i-1 (Fi-1) + Fi (Fi-1, O%1)}, 
Gi-1 
(13.5.25) 


Bi-1 (Gi-1) = max" {Bi (i) + ¥; @i-1,91)} 


where the initial conditions for these recursions are given by Equation (13.5.22). The a 
posteriori L-values are given by 


L(uj) = max* — {@-1 (61-1) + Fi Gi-1, %) + Bi €} 
(1-1.0;) €S1 


— max {či (j-1) + Pi Gi-1, 01) + Bi (ai)} - 
(941.0; ) €S0 


(13.5.26) 


The initial conditions for these recursions are given by Equation (13.5.22). 
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For binary constituent codes, BPSK modulation, and AWGN channel the expres- 
sion for the a posteriori L-values can be obtained using the log-domain quantities in 
Equation (13.5.19). The result is 














4/6 y? e 
Low) = 8% 4 requ) + ma | i us Ricco} 
No (9;-1.0;) €S1 
(13.5.27) 
* ~ Qype? 2. 
— max aj—1 (Gi-1) + + + A: i)}, 
(o)-1,0; )€S0 No 
where we have defined L° (u;) as 
l 
P(u; = 1) 
L° (u;i) = In ———_.. 13.5.28 
(ui) =n TET (13.5.28) 
It is seen that in this case the a posteriori L-values can be written as the sum of 





three terms. The first term, wy i , depends on the channel output corresponding to the 
systematic bits received by the decoder. The second term, L° (u;), depends on the a priori 
probabilities of the information bits. The remaining terms denote the contribution of the 
channel outputs corresponding to the parity bits. 

It can be easily shown that (see Problem 13.22) 


max*{x, y} = max{x, y} + ln (1 + e kyl) l 
(13.5.29) 
max*{x, y, z} = max* {max*{x, y}, z} ; 


The term In (1 +e ') is small when x and y are not close. Its maximum occurs when 
x = y, for which this term is In 2. It is clear that for large x and y or when x and y are not 
close, we can use the approximation 


max*{x, y} © max{x, y}. (13.5.30) 
Under similar conditions we can use the approximation 
max*{x, y, z} © max{x, y, z}. (13.5.31) 


The approximate relations in Equations (13.5.30) and (13.5.31) are valid when the 
the values of x and y (or x, y, and z) are not close. In general, approximating max* by 
max in Equation (13.5.25) would result in a small performance degradation. The resulting 
algorithm, which is a suboptimal implementation of the MAP algorithm, is called that 
max-Log-APP algorithm (or, max-log-MAP algorithm) 

Instead of using the approximations given in Equations (13.5.30) and (13.5.31), we 
can use a lookup table for values of the correction term In (1 +e hy l) to speed up the 
algorithm. 


13.5.2 Iterative Decoding for Turbo Codes 


We have seen that optimal decoding of turbo codes is impossible due to the large num- 
ber of states in the code trellis. A suboptimal iterative decoding algorithm, known as the 


738 Coding for Reliable Communications Chapter 13 


turbo decoding algorithm, was proposed by Berrou et al. (1993) and achieves excellent 
performance very close to the theoretical bound predicted by Shannon. 

The turbo decoding algorithm is based on iterative usage of the log-APP or the max- 
log-APP algorithm. As is seen from Equation (13.5.27), the LLR in the case, a rate 1/2 
RSCC can be written as the sum of three terms as 





Lu) = Ley} + LO (uj) + LO (ui), (13.5.32) 
where 
re 4J/Eys 
cyi = No 
P(u; = 1) 
L° (ui 
yarali Su = 0) 
2yPe! p (13.5.33) 
LO (u) = max“ fa- 07-1) + + + B; (i | 
(uj) Pe 1 (Oj-1) No B: ( 


2yPci P 


a * 
max fa- 1 (Oj-1) + —— No 


(41-101 )€5Sp 


+ Bi ca} 


and we have defined L, = ve. 

The term Ley} is called the channel L-value and denotes the effect of channel out- 
puts corresponding to the systematic bits. The second term L? (u;) is the a priori L-value 
and is a function of the a priori probabilities of the information sequence. The final term, 
L® (ui), represents the extrinsic L-value or extrinsic information that is the part of the a 
posteriori L-value and that does not depend on the a priori probabilities and the systematic 
information at the channel output. 

Let us assume that the binary information sequence u = (u1, u2, .. . , Un) is applied 
to the first RSCC with rate 1/2 and let us denote the parity bits at the output by c? = 
(c?, OSANA cy): The information sequence is passed through the interleaver to obtain 
u' = (u4, Uh, ..., Uy), this sequence is then applied to the second encoder to generate 
the parity sequence c’? = = (cP ote santa cas Sequences u, cP, and c’? are BPSK modu- 
lated and transmitted over a Gaussian channel. The corresponding output sequences are 
denoted by y’, yP”, and y’?. The MAP decoder for the first constituent code receives the 
pair (y5, yP). In the first iteration the decoder assumes all bits are equiprobable and there- 
fore the a priori L-values are set to zero. Having access to (y°, yP), the first decoder uses 
Equation (13.5.27) to compute the a posteriori L-values. At the output of the first con- 
stituent decoder, the decoder subtracts the channel L-values from ne és posteriori 
L-values to compute the extrinsic L-values. These values are denoted by LE Y Cu; ) and are 
permuted by the interleaver [] and then used by the second constituent decoder as its a 
priori L-values. In addition to this information, the second decoder is also supplied with 
y” and a permuted version of y* after passing it through the interleaver ||. The second 
decoder computes the extrinsic L-values denoted by LES (ui) and after permuting them 
through E supplies them to the first decoder, which in the next iteration uses these val- 
ues as its a priori L-values. This process is continued for a fixed number of iterations, or 
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Figure 13.25 Block diagram of a turbo decoder. 


until a certain criterion is met. After the last iteration, the a posteriori L-values L(u;) are 
used to make the final decision. 

The building block of the turbo decoder is a SISO decoder with inputs y*, y?, and 
L® (ui) and outputs L©(u;) and L(u;). In iterative decoding L (u;) is substituted by the 
extrinsic L-values provided by the other decoder. The block diagram of a turbo decoder is 
shown in Figure 13.25. 


13.5.3 Performance of Turbo Codes 


Turbo codes are characterized by excellent performance at low SNR’s. The performance 
of turbo codes improves with increasing the length of the interleaver and the number of 
iterations. The original turbo code studied by Berrou et al. (1993) used the 21/37 recur- 
sive systematic convolutional encoder shown in Figure 13.26. This code was used with an 
interleaver with a constraint length of N = 2!6 = 65536 and puncturing was employed to 
increase its rate to 1/2. The performance of the resulting code using the BCJR decoding 
algorithm is shown in Figure 13.27. After 18 iterations this code achieves an error proba- 
bility of 1075 at an SNR of 0.7 dB, this is only 0.5 dB away from the Shannon limit for a 
rate 1/2 code. 

One problem with the turbo codes is the existence of the error floor. As seen in 
Figure 13.27, the probability of error decreases sharply with increasing G,/No up to a 





Figure 13.26 The 21/37 recursive systematic convolutional code. 
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certain point. After this point, the error probability decreases very slowly. The existence 
of the error floor is a consequence of the distance properties of turbo codes. Its turns out 
that, although turbo codes have excellent performance at low signal-to-noise ratios, they 
have rather poor minimum distance. The reason they can perform well is that, although the 
distance properties are poor, the number of paths at low distance (called the multiplicity of 
that distance) is very low. In ordinary convolutional codes, we can design codes with much 
better minimum distance, but much higher multiplicity at low distances. Now referring to 
Equation (13.3.21), we see that for low SNR’s, the multiplicity (aa n) has a higher impact 
on the performance of the code, whereas at high SNR’s, the minimum distance of the code 
plays a major role; thus, the performance of turbo codes at high SNR’s sharply degrades. 

Figure 13.28 compares the performance of the 21/37 turbo code with a rate 1/2 con- 
volutional code with constraint length 14, and soft-decision Viterbi decoding. In the same 
plot, performance bounds (using the union bound) for both codes are also plotted. 


13.6 LOW-DENSITY PARITY-CHECK CODES 


Low-density parity-check codes (LDPC) are linear block codes that are characterized by 
a sparse parity-check matrix. These codes were originally introduced in Gallager (1960, 
1963) but were not widely studied for the next 20 years. These codes have been the topic of 
active research in the coding community motivated by their excellent performance, which 
is realized by using an iterative decoding scheme known as the sum-product algorithm. 
In fact, it has been shown that these codes are competitors to turbo codes in terms of 
performance and, if well designed, have better performance than turbo codes. Their excel- 
lent performance has resulted in their adoption in several communication and broadcasting 
standards. 

Low-density parity-check codes are linear block codes with very large code word 
length n usually in the thousands. The parity-check matrix H for these codes is a large 
matrix with very few 1’s in it. The term low density refers to the low density of 1’s in the 
parity-check matrix of these codes. 

A regular low-density parity-check code can be defined as a linear block code with 
a sparse m x n parity-check matrix H satisfying the following properties. 


1. There are w, 1’s ineachrow of H, where w, < min{m, n}. 
2. There are we 1’s in each column of H, where we < min{m, n}. 


The density of a low-density parity-check code, denoted by r, is defined as the ratio of the 
total number of 1’s in H to the total number of elements in H. The density is given by 


r=—>=>— (13.6.1) 


from which it is clear that 


=—, (13.6.2) 
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If the matrix H is full rank, then m = n — k 


Ra Sa (13.6.3) 
n Ww, 
otherwise, 
rank(H 
R:=1— =. (13.6.4) 


Low-density parity-check codes are conveniently represented by a graph representa- 
tion known as the Tanner graph. Tanner graphs can be used to represent any linear block 
code. The Tanner graph is a graphical representation of cH‘ = 0 and can be obtained by 
representing each code word component cj, 1 < i < n, as a node i, known as a variable 
node and shown by a circle, and each of the n — k constraints given by cH’ = 0 as anode 
j, 1 < j < n — k, known as a check node and shown by a square. Variable node i and 
check node j are connected by an edge if c; appears in the jth parity-check equation. The 
degree of each node is the number of edges connected to it. 

Figure13.29 depicts the Tanner graph for a (7, 4) Hamming code. The parity-check 
equations for this code are (see Example 13.2.4) 


fi: cı +c2+c3+cs =0, 
fa: c2+c3+c4+ c= 0, (13.6.5) 
fz: Cy +c + c4 +c = 0. 


The Tanner graph shown in Figure 13.29 does not represent a regular LDPC code, since 
neither the degrees of the check nodes are equal, nor the degrees of the variable nodes. 
Also note that the graph shown in Figure 13.29 includes cycles—that is, a path on the 
edges that starts from a node and ends in the same node. The length of the shortest cycle 
in a graph is called the girth of the graph. The girth of the graph shown in Figure 13.29 is 
4 (corresponding to the cycle ca > fi > cz > fz > ci). 

In the Tanner graph of Figure 13.29 the variable nodes, which correspond to the 
variables supplied to the Tanner graph, are denoted by circles on the left; and the check 
nodes or constraint nodes, which force a relation between the variables. These nodes are 
denoted by squares on the right. A binary sequence c is a code word if it satisfies the three 
constraints given by Equation (13.6.5). We also note that in this graph edges can only exists 
between a variable node and a check node, i.e., all edges connect a node on the left to a 
node on the right. Graphs with this property are called bipartite graphs. 

Let us define the indicator function of a proposition P as 


wee Le See (13.6.6) 
0 if P is false 


then, for instance, c is a code word of a Hamming code if the three equalities in Equation 
(13.6.5) are satisfied. With the notation just introduced in Equation (13.6.6), this means 
that c is a code word if and only if 


ô[cı +c2 +03 + cs = 0]ê[c2 + c3 + c4 + ce = O)8[c) +2 +04 +c = 0) = 1. (13.6.7) 
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Figure 13.29 The Tanner graph for the (7, 4) Hamming code. 


The graph shown in Figure 13.29 is a graphical representation of Equation (13.6.7). 

The Tanner graph of a regular low-density parity-check code consists of the usual 
constraint and variable nodes. The low-density and regular constraint of the code, however, 
makes the degree of all constraint (parity-check) nodes equal to w, which is much less than 
the code block length. Similarly, the degree of all variable nodes is equal to w,. The Tanner 
graph for a regular LDPC code with w, = 3 and we = 4 is shown in Figure 13.30. 

The Tanner graph of LDPC codes is usually a graph with cycles. We have previously 
defined the girth of a graph as the length of the shortest cycle in that graph. Obviously a 
Tanner graph with cycles has a girth that is least equal to 4. A common decoding technique 
used for LDPC codes is the sum-product algorithm. This algorithm is effective when the 
girth of the Tanner graph of the LDPC code is large. The reason for this behavior is that 
in order for the sum-product algorithm to be effective on a graph with cycles, the value 
of the extrinsic information, that is the information contributed by other variable nodes to 
the decoding of a certain node, must be high. If the girth of the LDPC code is low, the 
information corresponding to a bit loops back to itself very soon, hence providing a little 
amount of extrinsic information and resulting in poor performance. Design techniques for 
LDPC codes with large girth are topics of active research. 
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Figure 13.30 The Tanner graph for a regular LDPC code with w, = 4 and w, = 3. 


The low-density parity check codes descibed so far are regular LDPC codes. Irreg- 
ular LDPC codes are a generalized class of low-density parity check codes in which the 
number of 1’s in rows and columns is variable, but the overall number of 1’s in H is 
low. This added flexibility in the structure of irregular LDPC codes makes it possible to 
optimize the distribution of 1’s in rows and columns, thus improving the performance. In 
general, irregular LDPC codes perform better than regular LDPC codes. Irregular LDPC 
codes are among the most effective coding schemes that can achieve performance within a 
fraction of dB from channel capacity. 


Example 13.6.1 
The parity check matrix for a (12, 3) regular LDPC code is given in Equation (13.6.8). 
001001110000 
1 10010000001 
00010000 1 110 
010001100100 
H=|1 01000010010 (13.6.8) 
00011000 1001 
1 00110100000 
0000010100 1 1 
011000003110 «0 


Determine we, w,, and the density of this code. Write the parity check equations and draw the 
Tanner graph of the code. What is the girth of this code? 


Solution Each row of the parity check matrix contains four 1’s and each column of it con- 
tains three 1’s. Therefore w, = 3 and w; = 4. The density of the code denoted by r is 
given by 


(13.6.9) 
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If ¢ = (cy, ©, .. . , Cn) is a code word, then cH’ = 0. This relation yields a total of 9 
parity-check equations, each containing we = 4 of the c;’s. The parity-check equations are 
given below 


c3 ® c6 ® C7 Og = 0, 
cı BC2 OCs cn =O, 
c4 ® co OC Pei =O, 
c2 B c6 ® c7 cy = O, 
C1 B C3 ® cs PCy = 0, 
c4 B c5 Beg PCr = O, 
cı ® c4 BCs C7 = O, 
c6 O cs OC cig = 0, 
c2 B c3 ® co Bein =O. 


Note that each c;, for 1 < i < n, appears in exactly w, = 3 of the nine equations. The Tanner 
graph of the code is shown in Figure 13.31. From the Tanner graph it can be easily verified 
that the girth of this code is 4 (for instance, there is path of length 4 that starts at c4 and returns 
to c4 after passing through Cs). | 


13.6.1 Decoding LDPC Codes 


In this section we describe two algorithms for decoding LDPC codes. These algorithms 
are the bit-flipping algorithm and the sum-product algorithm, the latter also referred to as 
the belief propagation algorithm. The bit-flipping algorithm is a hard-decision decoding 
algorithm with low complexity. The sum-product algorithm is a soft-decision algorithm 
with higher complexity. 


The Bit-Flipping Algorithm. The bit-flipping algorithm is a hard-decision 
decoding algorithm. Let us assume that y is the hard channel output (i.e., the channel output 
quantized to 0 or 1.) In the first step of the bit-flipping algorithm, the syndrome s = y H' 
is computed. If the syndrome is zero, then we put ¢ = y and stop. Otherwise, we consider 





Figure 13.31 The Tanner graph for the LDPC code with H given in Equation (13.6.8). 
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the nonzero components of s. Each of these -nonzero components corresponds to a parity- 
check equation that is not satisfied. We then update y by flipping those components of 
y for which the number of unsatisfied check equations exceeds a certain threshold. After 
the update, the syndrome is computed again and the whole process is repeated for a fixed 
number of iterations or until the syndrome is equal to zero. The optimal value of the thresh- 
old in the bit-flipping algorithm has been computed by Gallager (1963). 

A modified, and much simpler, version of the bit-flipping algorithm is obtained by 
flipping only those bits in y for which the number of unsatisfied parity check equations has 
the largest value among all components of y and then repeating the syndrome computation. 
This process is continued until either the syndrome is zero or a predetermined number of 
iterations is reached. The interested reader can refer to Lin and Costello (2005) for more 
details on bit flipping decoding and its various variations. 


The Sum-Product Algorithm. The sum-product algorithm, which belongs to 
the class of message-passing algorithms, is an iterative decoding algorithm for LDPC 
codes that is based on passing likelihood ratios between variable and check nodes. The 
variable nodes receive the outputs of the channel and pass the likelihoods of code word 
components to the check nodes. Each check node updates and transmits the likelihood of 
each bit to the corresponding variable node by using the received likelihoods from all other 
variable nodes that are connected to it (extrinsic information). This process is repeated 
until a predetermined maximum number of iterations is achieved or until a code word is 
decoded,i.e., all check equations are satisfied. Here we provide only the steps in the sum- 
product algorithm for BPSK modulation over AWGN channels. The interested reader can 
refer to Ryan and Lin (2009) for details. 

For an AWGN channel with BPSK modulation, where 0 is mapped to V€ and 1 is 
mapped to —4J/$, when the ith channel output is y;, the corresponding likelihood ratio is 
given by 


)=In P(yi0) 
pall) 
2 
-(»-v€) /No 
e 


=In GA) ie (13.6.10) 


L(y 


The sum-product algorithm is initialized at each node i, 1 <i < n, by sending the 
likelihood values in Equation (13.6.10) to all check nodes j € M(i), where M(i) denotes 
the set of check nodes connected to the variable node i. In other words, for all 1 < i < n, 
and for all j € M (i), the message passed from node i to node j is 


Li>j = Li = — jj. (13.6.11) 


Section 13.7 Coding for Bandwidth-Constrained Channels . 747 


Check node j, after receiving all messages from the variable nodes connected to it, 
computes the message to be sent to node i, 1 <i < n, using the relation 


- 1 
Lj; =2tanh™' | I] tanh (3t) ; (13.6.12) 
VEN) {i} 


where N (j) denotes the set of variable nodes connected to check node j. This is done for 
alli e N(j). 

In the next step, the variable nodes update their information based on the received 
information from the check nodes. In this step, variable node i sends check node j € M(i) 
the updated likelihood 


Lisg= lit Do Lyi (13.6.13) 
j’EM(i)—-(j} 


Equations (13.6.12) and (13.6.13) are iteratively computed and the updated likeli- 
hoods are passed between check and variable nodes until either a preset number of itera- 
tions is achieved or a code word is detected. The detection is based on computing the total 
likelihoods at the variable nodes using 


| = Li+ D Eisi (13.6.14) 
jeM(i) 


and then detecting ¢ using 


M 1, if Liotal 0 
— Sas gate (13.6.15) 
0, otherwise 


13.7 CODING FOR BANDWIDTH-CONSTRAINED CHANNELS 


In the two major classes of codes studied so far, i.e., block and convolutional codes, an 
improvement in the performance of the communication system is achieved by expanding 
bandwidth. In both cases, the Euclidean distance between the transmitted coded waveforms 
is increased by use of coding, but at the same time, the bandwidth is increased by a factor of 
i= re These codes have wide applications in cases where there is enough bandwidth and 
the communication system designer is not under tight bandwidth constraints. An example 
of such a case is a deep-space communication system. However, in many practical applica- 
tions, we are dealing with communication channels with strict bandwidth constraints and 
the bandwidth expansion due to coding may not be acceptable. For example, in the trans- 
mission of digital data over telephone channels (modem design), we are dealing with a 
restricted-bandwidth channel, and the overhead due to coding imposes a major restriction 
on the transmission rate. In this section, we will discuss an integrated coding and mod- 
ulation scheme called trellis-coded modulation that is particularly useful for bandwidth- 
constrained channels. 
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13.7.1 Combined Coding and Modulation 


Use of block or convolutional codes introduces redundancy that, in turn, causes increased 
Euclidean distance between the coded waveforms. On the other hand, the dimensionality 
of the transmitted signal waveforms, when binary PSK modulation is employed, increases 
from k to n. This increase in dimensionality results in an increase in bandwidth since band- 
width and dimensionality are proportional, as seen from Equation (9.7.5). If we want to 
reap the benefits of coding and, at the same time, not increase the bandwidth, we have to use 
a modulation scheme other than binary BPSK, i.e., a scheme that is more bandwidth effi- 
cient. This means that we have to employ a multilevel/multiphase-modulation scheme to 
reduce the bandwidth. Of course using a multilevel/multiphase-modulation scheme results 
in a more “crowded” constellation and, at a constant power level, decreases the minimum 
Euclidean distance within the constellation. This certainly has a negative effect on the 
error performance of the overall coding-modulation scheme. But, as we will see next, this 
reduction of the minimum Euclidean distance within the constellation can be well compen- 
sated by an increase in the distance due to coding such that the overall performance shows 
considerable improvement. 

As an example, assume that, in the coding stage, we want to use a rate Z code. If the 
rate of the source is R bits/sec, the number of encoder output binary symbols/sec will be 
2R. If we want to use a constellation such that the bandwidth requirement is equal to the 
bandwidth requirement of the uncoded signal (no bandwidth expansion), we must assign m 
dimensions for each output binary symbol such that the resulting number of dimensions/sec 
is equal the number of dimensions/sec of the uncoded data, which is R. Therefore, we must 
have 


3 
R= km: (13.7.1) 
hence, 
2 
m= dimension/binary symbol. (13.7.2) 


3 


This means that the constellation should be designed so we have two dimensions for every 
three binary symbols. But three binary symbols are equivalent to eight points in the con- 
stellation; therefore, we can achieve our goal with an eight-point constellation in the two- 
dimensional space. One such constellation is, of course, an 8-PSK modulation scheme. 
Therefore, if we use a rate Z code in conjunction with an 8-PSK modulation scheme, there 
will be no bandwidth expansion. 

Now, examine how much coding gain we can obtain from such a scheme. Assuming 
that the available power is P with no coding, we have 


b= (13.7.3) 


The minimum squared Euclidean distance between two uncoded sequences is d? = 


(£2 /8)?, or 
ga (13.7.4) 
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Figure 13.32 The 8-PSK constellation used for 
bandwidth-efficient coding. 





If two information bits are mapped into a point in an 8-PSK constellation, the energy of 
this point is 

2P 
K 
From this, we can derive an expression for the minimum Euclidean distance within the 
constellation (see Figure 13.32) as 


És = (13.7.5) 


2P e aT P 
= a. sin? z722- 2 = (13.7.6) 
Obviously, the minimum Euclidean distance has been decreased. To see this effect, we 
derive the loss due to using this constellation: 


a. 2 
džin aA 


This loss has to be compensated by the code. Of course, the rate Z code employed here 
should not only compensate for this loss, but should also provide additional gain to justify 
its use of the overall coding-modulation scheme. We can use any block or convolutional 
code that can provide the minimum distance required to achieve a certain overall coding 
gain. For example, if we need an overall coding gain of 3 dB, the code must provide a cod- 
ing gain of 8.33 dB to compensate for the 5.33-dB loss due to modulation and to provide 
an extra 3-dB coding gain. A code that can provide such a high coding gain is a very com- 
plex code requiring a sophisticated encoder and decoder. However, by interpreting coding 
and modulation as a single entity, as shown in Section 13.7.2, we see that a comparable 
performance can be achieved using a much simpler coding scheme. 


=2 + v2 = 3.141 ~ 5.33 dB. (13.7.7) 








13.7.2 Trellis-Coded Modulation 


Trellis-coded modulation, or TCM, is a simple method for designing coded-modulation 
schemes that can achieve good overall performance. This coding-modulation scheme is 
based on the concept of mapping by set partitioning developed by Ungerboeck (1982). 
Mapping by set partitioning can be used in conjunction with both block and convolutional 
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codes, but due to the existence of a simple optimal soft-decision decoding algorithm for 
convolutional codes (the Viterbi algorithm), it has been mostly used with convolutional 
codes. 


Set Partitioning Principles. The key point in the partitioning of a constellation 
is to find subsets of it that are similar and the points inside each partition that are maximally 
separated. Starting with the original constellation, we partition it into two subsets that are 
congruent and ensure that the points within each partition are maximally apart. Then, we 
apply the same principle to each partition and continue. The point at which the partitioning 
is stopped depends on the code that we are using. This will be discussed shortly. 

An example of set partitioning is shown in Figure 13.33. We start with an 8-PSK 
constellation with power €s. The minimum distance within this constellation is 


dy = V (2—V2)%. (13.7.8) 


This constellation is partitioned into two subsets denoted by Bo and B,. Note that Bo and 
B; are congruent. There are many ways that the original 8-PSK constellation can be par- 
titioned into two congruent subsets, but Bo and B, provide the maximum intra-partition 
distance. This distance is easily seen to be 


dy = V2%s. (13.7.9) 


We further partition Bo and B; to obtain Co, C1, C2, and C3. As a result, the intra partition 
distance increases to 


dp = WE. (13.7.10) 


We can still go one step further to obtain eight partitions, each containing a single point. 
The corresponding subsets are denoted by Do through D7. Another example of set parti- 
tioning applied to a QAM constellation is given in Figure 13.34. This partitioning follows 
the general rules for set partitioning as just described. 


Coded Modulation. The block diagram of a coded-modulation scheme is shown 
in Figure 13.35. A block of length k information bits is broken into two subblocks of 
lengths kı and kp, k = ky + k2. The first kı bits are applied to an (71, kı) binary encoder. 
The output of the encoder consists of nı bits. These bits are used to choose one of 2”! 
subsets in the partitioning of the constellation. After the subset is chosen, the remain- 
ing k2 bits are used to choose one of the points in the selected subset. This means that 
there exist 2% points in each subset. Therefore, the constellation that is used contains 2”! 
subsets and each subset contains 2%? points. This gives us a rule for how large a con- 
stellation is required and how many steps in partitioning of this constellation must be 
taken. 

Ungerboeck (1982) has shown that, by choosing nı = kı + 1 and kp = 1 and using 
simple convolutional codes, we can design coded modulation schemes that achieve an over- 
all coding gain between 3 and 6 dB. One such scheme is shown in Figure 13.36. In this 
coding scheme, kı = 1, nı = 2, and k, = 1. The constellation contains 2”'+* = 8 points, 
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Figure 13.33 Partitioning of an 8-PSK constellation. 


which are partitioned into 2”! =4 subsets, each containing 2% = 2 points. The constella- 
tion chosen here is an 8-PSK constellation, and it is partitioned as shown in Figure 13.33. 
The convolutional code employed can be any rate = = 5 code. The constraint length 
of this code is a design parameter and can be chosen to provide the desired coding gain. 
Higher constraint lengths, of course, provide higher coding gains at the price of increased 
encoder--decoder complexity. In this very simple example, the constraint length has been 
chosen to be equal to 3. The (one-stage) trellis diagram of this code is also shown in 
Figure 13.36. 

The trellis diagram shown in Figure 13.36 is similar to the trellis diagram of an 


ordinary convolutional code. The main difference is that in this trellis we have parallel 
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Figure 13.34 Set partitioning of a 


16-QAM constellation. 
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Figure 13.36 A simple TCM scheme. 
paths (transitions). The reason for this is the existence of the extra k = 1 bit, which 


chooses a point in each subset. The two parallel paths connecting two states correspond to 
a subset, and any single path corresponds to a point inside the subset. One final question 
remains to be answered: What is the optimal assignment of the constellation points to the 
branches of the code trellis? Extensive computer simulations as well as heuristic reasoning 
result in the following rules: 


1. Parallel transitions, when they occur, correspond to signal points in a single subset at 
the last stage of partitioning (thus providing the largest Euclidean distance). In this 
example, Co = {Do, D4}, C2 = {D2, Do}, Cy = { D1, Ds}, and C3 = {D3, D7} cor- 
respond to parallel transitions. These points are separated by the maximum Euclidean 
distance of d) = 2./€;. 
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2. Transitions that have only one common end (i.e., they either originate from the same 
state, or end in the same state) correspond to subsets, in the last stage of partitioning, 
that come from a common parent subset in the preceding stage of set partituioning. 
In this example, {Co, C2} and {C1, C3} are such subsets, having parents Bo and By, 
respectively. The maximum distance in this case is dı = /2%;. 


3. The signal points should occur with equal frequency. 


To see how the trellis-coded modulation scheme of Figure 13.36 performs, we have 
to find the minimum Euclidean distance between two paths originating from a node and 
merging into another node. This distance, known as the free Euclidean distance and denoted 
by Drea, is an important characteristic of a trellis-coded modulation scheme. One obvi- 
ous candidate for Djeg is the Euclidean distance between two parallel transitions. The 
Euclidean distance between two parallel transitions is d} = 2./6,. Another candidate path 
is shown in Figure 13.37. However, the Euclidean distance between these two paths is 
d= a + 2d? = 4.58€,. Obviously, this is larger than the distance between two parallel 
transitions. It is easily verified that for this code, the free Euclidean distance is Dfe = d} = 
2/€,.To compare this result with an uncoded scheme, we note that in an uncoded scheme 
(see Equation 13.7.4), 

z 


uncode 


P 
a = 48, = 4, (13.7.11) 
R 
and in the coded scheme (see Equation 13.7.5), 
denies = 485 = 885. (13.7.12) 


Therefore, the coding gain is given by 


da 
Gooding = 28 = 2 ~ 3 dB. (13.7.13) 


uncoded 


C, 
(2,6) 


C 
(3,7 





Figure 13.37 Two candidate minimum distance paths. 
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Figure 13.38 An eight-state 
Ungerboeck encoder. 








D,D3DsD, 


Thus, this simple coding scheme is capable of achieving a 3-dB coding gain without 
increasing bandwidth. Of course, the price paid for this better performance is increased 
complexity in encoding and decoding. 

Instead of a 4-state trellis, a trellis with a higher number of states yields higher coding 
gains. Extensive computer simulations by Ungerboeck indicate that with 8, 16, 32, 64, 
128, and 256 states, coding gains in the range of 3.6—5.75 dB can be achieved. The trellis 
diagram for an 8-state trellis is shown in Figure 13.38. 


Decoding of Trellis-Coded Modulation Codes. The decoding of trellis-coded 
modulation is performed in two steps. Because each transition in the trellis corresponds 
to a partition of the signal set, and each partition generally corresponds to a number of 
signal points, the first step is to find the most likely signal point in each partition. This is 
accomplished by finding the point in each partition that is closest in Euclidean distance 


756 Coding for Reliable Communications Chapter 13 


to the received point. This first step in decoding of a trellis-coded modulation scheme is 
called subset decoding. After this step, corresponding to each transition in the trellis, there 
exists only one point (the most likely one), and only one Euclidean distance (the distance 
between the received point and this most likely point). The second step of the decoding 
procedure is to use this Euclidean distance to find a path through the trellis whose total 
Euclidean distance from the received sequence is minimum. This is done by applying the 
Viterbi algorithm. 


13.8 PRACTICAL APPLICATIONS OF CODING 


In the previous sections, we have seen that coding can be employed to improve the effective 
SNR and, thus, enhance the performance of the digital communication system. Block and 
convolutional codes and combinations of them in the form of concatenated and turbo codes 
as discussed earlier, have been applied to communication situations where bandwidth is 
not a major concern; thus, some bandwidth expansion due to coding is allowed. On the 
other hand, in cases where bandwidth is a major concern, as in digital communication over 
telephone channels, coded modulation can be employed. By using coding, the performance 
of practical digital communication systems has improved up to 9 dB, depending on the 
application and the type of the code employed. In this section, we discuss applications of 
coding to two digital communication cases. These include deep-space communications, 
and telephone-line modems. 


13.8.1 Coding for Deep-Space Communications 


Deep-space communication channels are characterized by very low SNR’s and practically 
no bandwidth limitations. The transmitter power is usually obtained from onboard solar 
cells and, therefore, is typically limited to 20-30 watts. The physical dimensions of the 
transmitting antenna is also quite limited and, therefore, its gain is also limited. The enor- 
mous distance between the transmitter and the receiver and lack of repeaters results in a 
very low SNR at the receiver. The channel noise can be characterized by a white Gaus- 
sian random process. These channels are very well modeled as AWGN channels. Because 
bandwidth is not a major concern on these channels, both block and convolutional codes 
can be applied. 

In the Viking orbiters and landers mission to Mars, a (32, 6) block code (Reed- 
Muller code) was employed. It provided a coding gain of approximately 4 dB compared 
to an uncoded PSK system at an error rate of 1076. Later, in the Voyager space mission to 
outer planets (Mars, Jupiter, and Saturn), convolutional codes with Viterbi decoding were 
employed. Two codes that were designed at the Jet Propulsion Laboratory (JPL) for that 
mission were a (2, 1) convolutional code with a constraint length of L = 7 with 


g=[l 10110 1) 
g= [0 00111 1J 
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and a (3, 1) convolutional code with L = 7 and 


g=[0 101 10 1, 
g=0 001111, 
g= 0 010111 


The first code has a free distance of dfree = 10, and the second code has dfree = 15. Both 
codes were decoded using a soft-decision Viterbi algorithm, in which the channel output 
was quantized to Q = 8 levels. The first code provides a coding gain of 5.1 dB at an error 
rate of 1075. The second code provides a gain of 5.7 dB. Both operate 4.5 dB from the 
theoretical limit predicted by Shannon. 

In subsequent missions of the Voyager to Uranus, in 1986, the (2, 1) convolutional 
code with L = 7 was used as an inner code in a concatenated coding scheme where a 
(255, 223) Reed-Solomon code served as the outer code. Viterbi decoding followed by a 
Reed-Solomon decoder at the Earth terminal provided a total coding gain of 8 dB at an 
error rate of 1076. This system operated at a data rate of approximately 30 kbits/sec. 

Other decoding algorithms for convolutional codes have also been applied to certain 
deep-space communication projects. For NASA’s Pioneer 9 mission, a (2, 1) convolutional 
code with a constraint length of L = 21 was designed with generator sequences (in octal 
representation) 


g.= [400000 0, 
g=7 15473 7. 


The decoder employed Fano’s algorithm with a soft-decision decoding scheme and eight 
levels of output quantization. Pioneers 10, 11, and 12 and Helios A and B German solar 
orbiter missions employed a (2, 1) convolutional code with a constraint length of L = 32. 
The generator sequences for this code (in octal representation) are 


gi=7 3 3 53 3 6767 2), 
g§=[15 3 3 5 3 3 6 7 6 7 2). 


This code has a free distance of diee = 23. For decoding, again, the Fano decoding algo- 
rithm with eight-level output quantization was employed. Majority logic decoding has also 
been used in a number of coding schemes designed for the INTELSAT communication 
satellites. As an example, an (8, 7) code with L = 48 designed to operate at 64 kbits/sec 
on an INTELSAT satellite was capable of improving the error rate from 1074 to 5 x 1078. 
In the Galileo space mission, a (4, 1, 14) convolutional code was used, resulting in a spec- 
tral bit-rate of ie which at €,/No of 1.75 dB, achieved an error probability of 1075. This 
code performed 2.5 dB from the Shannon limit. Using an outer (255, 223) Reed-Solomon 
code improves the coding gain by another 0.8 dB, resulting in a concatenated code oper- 
ating 1.7-dB from the Shannon limit. Turbo codes performing at a spectral-bit rate of 0.5 
and operating only 0.5-dB from the Shannon limit, compare quite favorably with all these 
systems. 
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13.8.2 Coding for Telephone-Line Modems 
Telephone-line channels are characterized by a limited bandwidth, typically between 300- 


3000 Hz, and a rather high SNR, which is usually 28-30 dB or more. Therefore, in design- 
ing coding schemes for telephone-line channels, we are faced with bandwidth limitation, 
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Figure 13.39 (a) Differential encoder, nonlinear convolutional encoder, and (b) signal constellation adopted in 
the V.32 standard. 
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This is in direct contrast to the deep-space communication channel, which is primarily 
power limited. This corresponds to the case of r >> 1 in Figure 12.17. Because bandwidth 
is limited, we have to use low dimensional signaling schemes. Since power is rather abun- 
dant, we can employ multilevel modulation schemes. As we have already seen in Section 
13.7, trellis-coded modulation is an appropriate scheme to be employed in such a case. 

Historically, the first modems on telephone channels (prior to the 1960s) 
employed frequency-shift keying with asynchronous detection and achieved bit rates in the 
range of 300-1200 bits/sec. Later, in the early 1960s, the first generation of synchronous 
modems employing 4-PSK modulation achieved bit rates of up to 2400 bits/sec. Advances 
in equalization techniques allowed for more sophisticated constellations, which resulted in 
higher bit rates. These included 8-PSK modems achieving a bit rate of 4800 bits/sec and 16- 
point QAM modems that increased the bit rate to 9600 bits/sec. In the early 1980s, modems 
with a bit rate of 14,400 bits/sec were introduced; they employed a 64-point QAM signal 
constellation. All these improvements were results of advances in equalization and signal- 
processing techniques and also improvements in the characteristics of telephone lines. 

The advent of trellis-coded modulation made it possible to design coded-modulation 
systems that improved overall system performance without requiring excess bandwidth. 
Trellis-coded modulation schemes based on variations of the original Ungerboeck’s codes 
and introduced by Wei (1984) were adopted as standard by the CCITT standard com- 
mittees. These codes are based on linear or nonlinear convolutional codes to guarantee 
invariance to 180°- or 90°-phase rotations. This is crucial in applications where differential 
encoding is employed to avoid phase ambiguities when a PLL is employed for carrier-phase 
estimation at the receiver. These codes achieve a coding gain comparable to Ungerboeck’s 
codes with the same number of states but, at the same time, provide the required phase 
invariance. In Figure 13.39, we have shown the combination of the differential encoder, 
the nonlinear convolutional encoder, and the signal mapping for the 8-state trellis-coded 
modulation system that is adopted in the CCITT V.32 standard. 


13.9 SUMMARY AND FURTHER READING 


Shannon’s noisy channel coding theorem, discussed in Chapter 12, states that reliable com- 
munication through a noisy channel is possible if and only if the transmission rate does 
not exceed the capacity of the channel. This theorem, however, is only an existence theo- 
rem not a constructive theorem. Although it shows the existence of codes at all rates less 
than capacity, it does not provide a constructive method to design such codes. Motivated 
by the promise of Shannon’s theorem a large amount of research activity in the past 60 
years has been focused on the design and implementation of coding schemes that can get 
close to Shannon’s bound but at the same time have enough structure to make their decod- 
ing tractable. In this chapter we focused on the two class of linear block codes and con- 
volutional codes and introduced their structure as well as maximum-likelihood decoding 
algorithms for them. 

We discussed two methods of decoding for block and convolutional codes; soft- and 
hard-decision decoding. The performance of soft-decision decoding is roughly 2 dB better 
than the performance of hard decision decoding. Hard decision decoding is usually the 
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decoding method used for linear block codes, and soft-decision decoding, implemented by 
using the Viterbi algorithm, is the preferred method of decoding for convolutional codes, 

Wealso introduced methods for combining two codes into more effective codes with 
tractable decoding complexity based on the decoding algorithm of the individual code 
components. Among these codes we emphasized turbo codes and the BCJR soft decision 
decoding used in an iterative fashion for decoding these codes. We also introduced LDPC 
codes and introduced two iterative decoding algorithms for them, the bit-flipping algorithm 
and the sum-product algorithm. 

We finally discussed applications of coding for bandwidth-constrained channels. We 
introduced trellis coded modulation and the notion of combined coding and modulation. 
Ungerboeck’s set partitioning rules were introduced and we showed that these rules com- 
bined with simple trellis codes can provide coding gains in the range of 3-6 dB. 

Papers by Golay (1949), Hamming (1950), Hocquenghem (1959), Bose and Ray- 
Chaudhuri (1960a, 1960b), and Reed and Solomon (1960) are landmark papers in the 
development of block codes. Convolutional codes were introduced by Elias (1955) and 
various methods for their decoding were developed by Wozencraft and Reiffen (1961), 
Fano (1963), Zigangirov (1966), Viterbi (1967), and Jelinek (1969). Trellis-coded modula- 
tion was introduced by Ungerboeck (1982) and later developed by Forney (1988a, 1988b). 
Product codes were introduced by Elias (1954) and concatenated codes were developed 
and analyzed by Forney (1966). Berrou, Glavieux, and Thitimajshima (1993) introduced 
turbo codes. Low-density parity check codes were introduced by Gallager (1962, 1963). 

The books by Berlekamp (1968), Peterson and Weldon (1972), MacWilliams and 
Sloane (1977), Lin and Costello (2005), Blahut (1983), Wicker (1995), Johannesson and 
Zigangirov (1999), Biglieri et al. (1991), and Ryan and Lin (2009) provide comprehensive 
coverage of the topics covered in this chapter. 


PROBLEMS 


13.1 In Example 13.2.1, find the minimum distance of the code. Which code word(s) is 
(are) minimum weight? 


13.2 In Example 13.2.3, verify that all code words of the original code satisfy 
cH' =0. 
13.3 By listing all code words of the (7, 4) Hamming code, verify that its minimum dis- 
tance is equal to 3. 


13.4 Find the parity check matrix and the generator matrix of a (15, 11) Hamming code 
in the systematic form. 


13.5 Show that the minimum Hamming distance of a linear block code is equal to the 
minimum number of columns of its parity check matrix that are linearly dependent. 
From this, conclude that the minimum Hamming distance of a Hamming code is 
always equal to 3. 
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13.6 


13.7 


13.8 


13.9 


13.10 


A simple repetition code of blocklength n is a simple code consisting of only two 
code words: (0, 0, ...,0) and (1, 1,..., 1). Find the parity check matrix and the 
n>S_— — — 


n n 
generator matrix of this code in the systematic form. What is the rate and the mini- 
mum distance of this code? 


The matrix 
100110 


G=|010101 
001011 


is the generator matrix of a (6, 3) linear code. This code is extended by adding an 
overall parity check bit to each code word so that the Hamming weight of each 
resulting code word is even. 


1. Find the parity check matrix of the extended code. 
2. What is the minimum distance of the extended code? 


3. Find the coding gain of the extended code. 


Compare the block error probability of an uncoded system with a system that uses 
a (15, 11) Hamming code. The transmission rate is R = 10* bps and the channel is 
AWGN with a received power of 1 Watt anda noise power spectral density of o = 
107!! Watts/Hz. The modulation scheme is binary PSK and soft-decision decoding 
is employed. Repeat the comparison when hard-decision decoding is employed. 


Generate the standard array for a (7, 4) Hamming code. Use it to decodethe received 
sequence (1, 1, 1, 0, 1, 0, 0). 


In a coded communication system, M messages 1, 2, ..., M = 2* are transmitted 
by M baseband signals xı (t), x2(t),..., xm (t), each of duration nT. The general 
form of x; (t) is given by 


n-1 
x(t) =) vt — JT), 

j=0 
where yj; (t) can be either of the two signals yı (t) or wo(t) where Yı (t) = Yr(t) = 
0 for allt ¢ [0,7]. We further assume that w(t) and y(t) have equal energy 
€ and the channel is ideal (no attenuation) with additive white Gaussian noise of 
power spectral density w, This means that the received signal is r(t) = x(t) +n(t), 
where x(t) is one of the x;(t)’s and n(t) represents the noise. 


1. With wi(t) = —yo2(t), show that N, the dimensionality of the signal space, 
satisfies N <n. 


2. Show that, in general, N < 2n. 
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3. With M = 2, show that for general y(t) and w(t), 


P(Error|x;(t) sent) < f e [ Vicen7er dr, 
RN 


where r, X;, and x2 are the vector representations of r (t), xı (t), and x2(t) in the 
N-dimensional space. 


4. Using the result of Part 3, show that for general M, 
P(Error|xm(t) sent) < Xo J . -f S(r|Xm)f 1 |xm’) dr. 
l<m!<M RN 


m! £m 


5. Show that 


X m—X y? 


Tf FOS Om dr =e m, 
RN 


and therefore, 


Xm-X pi 
P (Error|xm(t) sent) < D e m 
1<m'<M 
mm! Am 
13.11 A convolutional code is described by 

gı =[100) 

g2.=[101) 

g, =[111]. 


1. Draw the encoder corresponding to this code. 

2. Draw the state transition diagram for this code. 

3. Draw the trellis diagram for this code. 

4. Find the transfer function and the free distance of this code. 


5. Verify whether this code is catastrophic or not. 


13.12 Show that, in the trellis diagram of a convolutional code, 2* branches enter each 
state and 2* branches leave each state. 


13.13 The block diagram of a binary convolutional code is shown in Figure P-13.13. 
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Figure P-13.13 


. Draw the state diagram for the code. 
. Find T (D), the transfer function of the code. 


. What is dfree, the minimum free distance of the code? 


bh U N e 


. Assume that a message has been encoded by this code and transmitted over a 
binary symmetric channel with an error probability of p = 1075. If the received 
sequence is r = (110, 110, 110, 111, 010, 101, 101), use the Viterbi algorithm 
to find the transmitted bit sequence. 


5. Find an upper bound to the bit error probability of the code when the preceding 
binary symmetric channel is employed. Make any reasonable approximations. 


13.14 The block diagram of a (3, 1) convolutional code is shown in Figure P-13.14. 


k=1 





Figure P-13.14 


1. Draw the state diagram of the code. 
2. Find the transfer function T (D) of the code. 


3. Find the minimum free distance (dfree) of the code and show the corresponding 
path (at distance dfree from the all-zero code word) on the trellis. 
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4. Assume that four information bits (x1, x2, x3, x4), followed by two zero bits, 
have been encoded and sent via a binary symmetric channel with a crossover 
probability equal to 0.1. The received sequence is (111,111,111, 111, 
111,111). Use the Viterbi decoding algorithm to find the most likely data 
sequence. 


13.15 The convolutional code of Problem 13.11 is used for transmission over an AWGN 
channel with hard-decision decoding. The output of the demodulator-detector is 
(101001011110111...). Using the Viterbi algorithm, find the transmitted sequence. 


13.16 Repeat Problem 13.13 for a code with 


g, =[1 10) 
g2=[101) 
g3=[11 1]. 


13.17 Show the paths corresponding to all code words of weight 6 in Example 13.3.3. 


13.18 Consider the convolutional code generated by the encoder shown in Figure P-13.18. 








Figure P-13.18 


1. Find the transfer function of the code in the form T(N, D). 
2. Find diree Of the code. 


3. If the code is used on a channel using hard-decision Viterbi decoding, assuming 
the crossover probability of the channel is p = 1076, use the hard-decision 
bound to find an upper bound on the average bit error probability of the code. 


13.19 Let x; and x2 be two code words of length n with distance d; assume that these two 
code words are transmitted via a binary symmetric channel with crossover probabil- 
ity p. Let P,(d) denote the error probability in the transmission of these two code 
words. 


1. Show that Fi 


Pad) < >) V p(y le) Pi \X2), 


i=1 


where the summation is over all binary sequences y;. 
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2. From your answer to Part 1, conclude that 
d 
P,(d) < [4p(1 — p)]? . 
3. Using the result of Part 2, prove Equation (13.2.36). 
13.20 The complementary error function erfc(x) is defined by 
tos ea 
erfc(x) = —= e : 
VT Jx 
1. Express Q(x) in terms of erfc(x). 
2. Using the inequality 


erfc(./x +y) <erfc(/x)e" x20, y>0, 


prove that the bound on the average bit error probability of a convolutional 
code is 


1 ƏT(D, N) 
Pos apo (v drec Ree ) etree Rev -aN 





N=1,D=e~ Rev 
where yp = b, and we assume that soft-decision decoding is employed. 


13.21 A product code is designed using an (n1, kı) = (7, 4) systematic Hamming code 
and an (n2, k2) = (6, 2) systematic code with generator matrix 


100111 
G= 
Been 


1. Determine the minimum distance of the product code. 


as its component codes. 


2. Ifthe product code is decoded using optimal hard-decision decoding, what is the 
maximum number of errors it can correct? 


3. Consider an information sequence of length k = kik. = 8 consisting of all 
1’s. Use an n? x nı = 6 x 7 matrix similar to Figure 13.20 to find all the 
ninz2 — kik2 = 34 parity check bits for this sequence. 


4. Assume that, instead of optimal hard-decision decoding, we employ a simple 
decoding scheme in which hard-decision decoding is first applied to the rows 
and then to the columns. Show that, using this simple strategy, the code can 
correct all error patterns of weight 3 but it cannot correct all error patterns of 
weight 4. 


13.22 Prove Equation (13.5.29). 


13.23 A trellis-coded modulation system uses an 8-ary PAM signal set given by {+1, +3, 
+5, +7} and the 4-state trellis encoder shown in Figure 13.36(a). 
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1. Using the set partitioning rules, partition the signal set into four subsets. 


2. If the channel is an additive white Gaussian noise channel, and at the output of 
the matched filter the sequence (—.2, 1.1, 6, 4, —3, —4.8, 3.3) is observed, what 
is the most likely transmitted sequence? 


COMPUTER PROBLEMS 


13.1 Error Probability in Repetition Code 


The crossover probability in a binary symmetric channel is p = 0.3. Evaluate and 
plot the error probability P, for a simple repetition block code of length n, where n 
is odd, for n = 3, 5,...,41. Plot a graph of P, versus n. 


13.2 Linear Block Codes 
The generator matrix for a (10, 4) linear block code is given by 


1001110111 
1110001110 
0110110101 
1101111001 


Determine all code words and the minimum weight of the code. 


13.3 Hamming Codes 


The objective of this problem is to determine all the code words of the (15, 11) 
Hamming code. 


1. Determine the parity check matrix of the (15, 11) Hamming code, which has the 
form 


H=[P'|I;]. 
2. Determine the generator matrix G, which has the form 
G=[I;|P]. 
3. Use the generator matric to generate and list all the code words in the code. 


13.4 Performance of Hard-Decision Decoding 


The (15, 11) Hamming code is used with antipodal signaling, and hard-decision 
decoding is used at the receiver to detect the code words. The error probability for a 
single bit in any code word is given by 


[2R.% 
P =Q sa , 
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where @,/No is the SNR/bit and R, is the code rate (11/15). Compute and plot the 
(upper bound) code word error probability as a function of 6,/No. 


13.5 Soft-Decision Decoding 


The (15, 11) Hamming code is used with antipodal signaling, and soft-decision 
decoding is used at the receiver to detect the code words. Using the upper bound 


on the error probability 
2dminR-& 
Pu=M-D0| TFT] 
0 


evaluate and graph Py as a function of the SNR/bit €,/No. 


13.6 Convolutional Encoder 


Determine the output sequence of the convolutional encoder shown in Figure 
CP-13.6 when the information sequence is 


10011100110000111. 





Figure CP-13.6 


The encoder is initially in the all-zero state. 


13.7 Viterbi Decoding 


The encoder shown in Figure CP-13.7 is used to transmit a sequence of 5 information 
bits followed by two zeros to flush out the decoder. The quantized (hard-decision) 
received sequence at the input to the Viterbi decoder is 


y=01101111010001. 


1. Sketch the trellis for this convolutional code and label the branches. 


2. Determine the maximum-likelihood 5 bit information sequence at the output of 
the Viterbi decoder. 
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ee 


Figure CP-13.7 


13.8 Write a MATLAB program to generate the parity-check bits when a general binary 
sequence is encoded by the 5/7 RSCC shown in Figure 13.24. Then apply the pro- 
gram to the input sequence 


u=[0 1 1 1 0 0 100 1 100 100 1 11 l] 


and generate the corresponding parity bits. Assume that the encoder starts from the 
zero state. 


13.9 Write a MATLAB program to implement the computation of the forward recursion 
of the BCJR algorithm as given by Equation (13.5.8). 


13.10 Write a MATLAB program to implement the computation of the backward recursion 
of the BCJR algorfithm as given by Equation (13.5.11).. 


13.11 Write a MATLAB program to implement the sum-product algorithm for decoding 
LDPC codes using Equations (13.6.11)—(13.6.15) 
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In Chapters 8, 9, and 10, we described digital modulation and demodulation methods for 
transmission of information over two types of channels, namely, an additive Gaussian noise 
channel and a linear filter channel. Such channel models are appropriate for characterizing 
physical channels that are relatively static; i.e., the channel transmission characteristics are 
generally modeled as time invariant. In this chapter, we treat modulation and demodulation 
techniques that are appropriate for wireless communication channels, such as radio and 
acoustic communication channels, whose transmission characteristics are time varying. 


14.1 CHARACTERIZATION OF PHYSICAL WIRELESS CHANNELS 


Physical channels with time-varying transmission characteristics may be characterized as 
time-varying linear filters. Such linear filters are described by a time-varying impulse 
response c(T; t), where c(T; t) is the response of the channel at time ¢ due to an impulse 
applied at time t — t. Thus, t denotes the “age” (elapsed time) variable. The time-varying 
linear filter model of the channel with additive noise was previously shown in Figure 1.10. 
We cite the following examples of wireless communication channels that can be modeled 
in this manner. 


Signal Transmission via Ionospheric Propagation in the HF Band. We recall 
from our discussion in Chapter 1 that sky-wave propagation, as illustrated in Figure 1.6, 
results from transmitted signals [in the high-frequency (HF) band] being bent or refracted 
by the ionosphere, which consists of several layers of charged particles ranging in altitude 
from 30 to 250 miles above the surface of the earth. As a consequence of these ionospheric 
layers, the signal arrives at the receiver via different propagation paths at different delays. 
These signal components are called multipath components. The signal multipath compo- 
nents generally have different carrier-phase offsets and, hence, they may add destructively 
at times, resulting in a phenomenon called signal fading. Hence, signal fading is a result 
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of multipath signal propagation. To characterize such channel behavior, we adopt a time- 
varying impulse-response model. 


Mobile Cellular Transmission. In mobile cellular radio transmission between 
a base station and a mobile telephone, the signal transmitted from the base station to the 
mobile receiver is usually reflected from surrounding buildings, hills, and other obstruc- 
tions. As a consequence, we observe multiple propagation paths arriving at the receiver 
at different delays. Hence, the received signal has characteristics similar to those for iono- 
spheric propagation. The same is true of transmission from the mobile telephone to the base 
station. Moreover, the speed that the mobile (automobile, train, etc.) is traveling results in 
frequency offsets, called Doppler shifts, of the various frequency components (see Problem 
14.1) of the signal. 


Line-of-sight Microwave Radio Transmission. In line-of-sight (LOS) radio 
transmission of signals, the transmitting and receiving antennas are generally mounted 
on high towers, in order to avoid obstructions, such as buildings and hills, in the path of 
signal propagation. However, when there are tall obstructions or hilly terrain in the path 
of propagation, it is likely that signals will be reflected from the ground to the receiving 
antenna as illustrated in Figure 14.1. This is especially a problem under severe weather 
conditions. In this case, there is a received signal component that arrives via the direct path 
and an ensemble of secondary paths that are reflected from the ground terrain. The latter 
arrive at the receiver with various delays and constitute multipath propagation. Relatively 
narrow-beamwidth antennas are employed in microwave LOS transmission to reduce the 
occurrence of secondary reflections. Nevertheless, some secondary signal reflections are 
frequently observed in practice. Such secondary signal reflections generally vary with time, 
so the channel may be characterized by a time-varying impulse response. 


Airplane-to-Airplane Radio Communications. In radio communications bet- 
ween two aircraft, it is possible for secondary signal components to be received from 
ground reflections, as illustrated in Figure 14.2. This is especially the case when omni- 
directional antennas are employed in the communication system. The ensemble of ground- 
reflected signal components generally arrive at the receiver with different delays and 
different attenuations. In addition, the motions of the aircraft result in Doppler frequency 
offsets in the various signal components. In many respects, this situation is similar to that 
in mobile cellular communications. 


Direct path 






asy 


sec% 





Figure 14.1 Illustration of 
multipath propagation in LOS 
“+ microwave transmission. 
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Direct path 
Secondary path 
Figure 14.2 Illustration 
of multipath propagation 
Ground 7 in airplane-to-airplane 


communications. 


Underwater Acoustic Signal Transmission. A shallow-water acoustic channel 
is generally characterized as a multipath channel due to acoustic signal reflections from the 
surface and the bottom of the sea. Because of wave motion, the signal multipath compo- 
nents undergo time-varying propagation delays which result in signal fading. In addition, 
there is frequency-dependent attenuation, which increases proportionally as the square of 
the signal frequency. 

The channels briefly described above may be generally characterized as linear sys- 
tems with time-varying impulse responses. Since it is generally difficult, if not impossi- 
ble, to characterize the microscopic effects of signal transmission on channels as the ones 
described above in a deterministic fashion, it is logical to adopt a statistical characteriza- 
tion. Such an approach is described below. 


14.2 CHANNEL MODELS FOR TIME-VARIANT MULTIPATH CHANNELS 


As we have observed, there are basically two distinct characteristics of the types of chan- 
nels described above. One characteristic is that the transmitted signal arrives at the receiver 
via multiple propagation paths, each of which has an associated time delay. For example, if 
we transmit an extremely short pulse, the channel response due to multiple scatterers (such 
as ionized particles in the ionosphere) might appear as shown in Figure 14.3. Because the 
received signal is spread in time due to the multiple scatterers at different delays, we say 
that the channel is time dispersive. 

A second characteristic of the types of channels described above is concerned with 
the time variations in the structure of the medium. As a result of such time variations, the 
response of the channel to any signal transmitted through it will change with time. Hence, 
if we repeat the short pulse transmission experiment over and over, we would observe 
changes in the received signal, which are due to physical changes in the medium. Such 
changes include variations in the relative delays of signals from the multiple scatterers. 
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Transmitted signal Received signal 


t=t, 0 t 


TX IX Figure 14.3 Illustration 
a ar of time-variant channel-response 
0 t 


Eh characteristics. 
Hence, the received signal might appear as illustrated in Figure 14.3. In other words, the 
impulse response of the channel is varying with time. In general, the time variations in 
the received signal appear to be unpredictable to the user of the channel. This leads us to 
characterize the time-variant multipath channel statistically. 

To obtain a statistical description of the channel, let us consider the transmission of 
an unmodulated carrier 

s(t) = A cos 27 fet. (14.2.1) 


The received signal in the absence of noise may be expressed as 


x(t) = A œn (t) cos[2m felt — tr] 


= A Re [Ee (Eei Semn O a (14.2.2) 
n 


where œ, (t) is the time-variant attenuation factor associated with the nth propagation path 
and T,,(¢) is the corresponding propagation delay. The complex-valued signal 


ODA 
=Y an(t) Fen (14.2.3) 


represents the response of the channel to the complex exponential exp(j21 fet). We note 
that, although the input to the channel is a monochromatic signal; i.e., a signal at a single 
frequency, the output of the channel consists of a signal that contains many different fre- 
quency components. These new components are generated as a result of the time variations 
in the channel response. The r.m.s. (root-mean-square) spectral width of c(t) is called the 
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Doppler frequency spread of the channel and is denoted as B4. This quantity is a measure 
of how rapidly the signal c(t) is changing with time. If c(t) changes slowly, the Doppler 
frequency spread is relatively small, while if c(t) changes rapidly, the Doppler frequency 
spread is large. 

We may view the received complex-valued signal c(t) in Equation (14.2.3) as the 
sum of a number of vectors (phasors) each of which has a time-variant amplitude a, (t) 
and phase @,(t). In general, it takes large dynamic changes in the physical medium to 
cause a large change in {a,(t)}. On the other hand, the phases {¢,(t)} will change by 
2x radians whenever {t,,(t)} change by 1/f,. But 1/fe is a small number and, hence, the 
phases {¢,(t)} change by 2x or more radians with relatively small changes of the medium 
characteristics. We also expect the delays {t,(t)} associated with the different signal paths 
to change at different rates and in as unpredictable (random) manner. This implies that 
the complex-valued signal c(t) in Equation (14.2.3) can be modeled as a random process. 
When there are a large number of signal propagation paths, the central limit theorem can 
be applied. Thus, c(t) can be modeled as a complex-valued Gaussian random process. 

The multipath propagation model for the channel, embodied in the received signal 
x(t) or, equivalently, c(t) given by Equation (14.2.3), results in signal fading. The fading 
phenomenon is primarily a result of the time-variant phase factors {¢,(t)}. At times, the 
complex-valued vectors in c(t) add destructively to reduce the power level of the received 
signal. At other times, the vectors in c(t) add constructively and, thus, produce a large sig- 
nal value. The amplitude variations in the received signal due to the time-variant multipath 
propagation in the channel are called signal fading. 


Multipath Spread and Coherence Bandwidth. The time span between the first- 
and the last-arriving multipath components in a transmitted signal is called the multipath 
(time) spread of the channel. We denote this channel parameter as Tn. A related parameter 
is the reciprocal of the multipath spread, which provides a measure of the bandwidth over 
which frequency components of the transmitted signal will be affected similarly by the 
channel. Thus, we define the channel parameter 

Bo = : 14.2.4 

fe ia Ta (14.2. ) 
andcall it the coherence bandwidth of the channel. For example, all frequency components 
of a transmitted signal that fall within the coherence bandwidth Be» will fade simultane- 
ously. If the transmitted signal has a bandwidth W < Bo, the channel is called frequency 
nonselective. Thus, at any instant in time, all frequency components of the transmitted 
signal fade simultaneously. On the other hand, if the transmitted signal has a bandwidth 
W > Bc, the frequencies in the signal separated by an amount greater than Bæ will be 
affected differently by the channel. Hence, at any instant, some frequency components in 
the transmitted signal may fade, whereas other frequency components may not. In such a 
case, the channel is said to be frequency Selective. 


Doppler Spread and Coherence Time. Another channel parameter is the recip- 
rocal of the Doppler spread, which measures the time interval over which the channel 
response will change very little. Thus, we define the parameter 
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1 


Ta = — 
amp 


(14.2.5) 
and call it the coherence time of the channel. For example, a signal that is transmitted at 
two different time instants, separated in time by an amount less that Tot, will be affected 
similarly by the channel. Hence, if the signal that is transmitted in the first time instant is 
highly attenuated by the channel, the signal transmitted in the second time instant will also 
be highly attenuated. On the other hand, if the time interval between the signal transmis- 
sions is much greater than the coherence time Tot, the channel will likely affect the two 
signal transmissions differently. 


Example 14.2.1 
A shortwave ionospheric radio channel is characterized by a multipath spread of T,, = 5 ms 
and a Doppler spread of By = 0.1 Hz. Determine the coherence bandwidth and the coherence 
time of the channel. 


Solution The coherence bandwidth of the channel is 


1 
Bo = T, = 200 Hz. 


The coherence time of the channel is 


1 
Ta = — = 10 sec. a 
d 


The Channel Spread Factor. The product Tm Ba is usually called the channel 
spread factor. If Tm Ba < 1, the channel is called underspread and if Tm Ba > 1, the channel 
is said to be overspread. The spread factor usually provides some indication on whether 
or not phase-coherent demodulation is possible at the receiver. In general, if the channel 
is overspread, due either to a large multipath spread or a large Doppler spread or both, the 
estimation of the carrier phase is extremely difficult because of the rapid time variations 
(Tot K Tm) in the channel that occur in the time interval Tm. On the other hand, if the 
channel is underspread, the channel-time variation is slow relative to the multipath spread 
(Ta >> Tm) and, hence, the carrier phase of the received signal can be estimated with good 
precision. Fortunately, most physical time-varying channels encountered in practice are 
underspread. Table 14.1 lists the values of these channel parameters for several multipath 
channels. 


14.2.1 Frequency Nonselective Fading Channel ` 
Let us consider the transmission of a signal s(t) over a linear time-varying channel with 
the frequency response C( f; t). If S( f) denotes the spectrum of the transmitted signal, the 


received signal is given in the frequency domain as C(f; t)S(f) = R(f) and in the time 
domain as the inverse Fourier transform of R( f); i.e., 


r(t)= / CCF; DSe" df. (14.2.6) 
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TABLE 14.1 MULTIPATH SPREAD, DOPPLER SPREAD, AND SPREAD FACTOR FOR SEVERAL TIME-VARIANT 
MULTIPATH CHANNELS 








Type of Channel Multipath Duration (sec) Doppler Spread (Hz) Spread Factor 
Shortwave ionospheric 10-3 — 107? 107! —1 1074 — 1072 
propagation (HF) 
Ionospheric propagation under 10-3 — 107? 10 — 100 107? — 1 
disturbed auroral conditions 
Ionospheric forward 1074 10 10-3 
scatter (VHF) 
Tropospheric scatter 10-6 10 10-5 
Mobile cellular (UHF) 1075 100 10-3 
Wireless indoor LANs 10-7 10? 1075 
at 5 GHz 


Now, suppose that the bandwidth W of the transmitted signal s(t) satisfies the con- 
dition W < Be, so that the channel is frequency nonselective. This condition implies 
that, over the bandwidth range (—W, W) occupied by the transmitted signal, the frequency 
response is constant in the frequency variable f and may be denoted as 


Cf; tij = CO; t) = c(t). (14.2.7) 


Therefore, Equation (14.2.6) simplifies to 


r(t) = c(t) / S(fye?™' df 


= c(t)s(t). (14.2.8) 


Consequently, in a frequency nonselective channel, the channel distorts the transmitted 
signal in a multiplicative manner, as illustrated in Figure 14.4. 

Another view of the frequency nonselective channel is obtained when the signal s(t) 
with bandwidth W has a time duration T ~ 1/W; because W < By = 1/Tm, it fol- 
lows that T >> Tm. In this case, the time dispersion due to the channel multipath is much 
smaller than the time duration T of the transmitted signal. Hence, the channel multipath 
components, whose amplitudes and phases are given by Equation (14.2.3), are not resolv- 
able. Thus, they are seen only as a single disturbance that multiplies the transmitted signal 
S(t) as shown in Figure 14.4 and causes fading. 


Slow Fading Frequency Nonselective Channel. The frequency nonselective 
channel model illustrated in Figure 14.4 applies when the transmitted signal bandwidth W 


Transmitted signal 


s(t) 


Received signal 





Figure 14.4 Model of a frequency 
nonselective time-varying channel 
c(t) AWGN with AWGN. 
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satisfies the condition W < Be. A further simplification occurs when the coherence time 
Tz of the channel is much larger than the signal time duration T (i.e., Ta >> T). In this 
case, the channel characteristic c(t) may be treated as a constant over the signal duration 
T and may be expressed as 


c(t) =a (eO ,0<t<T 


— gei? OZET’ (14.2.9) 


We call such a channel a slowly fading, frequency nonselective channel. 


Example 14.2.2 


Consider the radio channel in Example 14.2.1. The signal s(t) transmitted over the channel 
has a bandwidth W = 50 Hz and a time duration of T ~ 1/W = 20 msec. Is this a frequency 
nonselective channel? Is the channel slowly fading? 


Solution Because the bandwidth W < Bæ = 200 Hz, the channel is frequency nonselec- 
tive. Furthermore, because T < Te = 10 sec, the channel is also slowly fading. C] 


Frequency Nonselective Rayleigh Fading Channel. In the channel model illus- 
trated in Figure 14.4, the complex-valued channel gain may be expressed as 


c(t) = c (t) + jei(t) 
=a(tel?, (14.2.10) 


a(t) = ,/c2(t) + c? (t), 


c(t) 
cr(t) i 


When c(t), as given by Equation (14.2.3), consists of many nonresolvable multipath com- 
ponents having random amplitude and uniformly distributed phases, the two components 
c,(t) and c;(t) are usually modeled as zero-mean, Gaussian random processes with zero 
cross correlation. Therefore, œ (t) is characterized statistically by the Rayleigh probability 
distribution and @(¢) is uniformly distributed over the interval (0, 2x). As a consequence, 
the channel is called a Rayleigh fading channel. The Rayleigh fading signal amplitude is 
described by the probability density function (PDF) 


where 





¢(t) = arctan (14.2.11) 


f(a) = ae a>0 (14.2.12) 


and f (œ) = Ofora < 0. The parameter o? = E(c?) = E(c?). 


Example 14.2.3 


Use the method described in Computer Problem 5.2 to generate a sequence of 20,000 statis- 
tically independent and identically distributed Rayleigh random variables. Plot the histogram 
for the 20,000 symbols and compare it with the corresponding Rayleigh PDF. 
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Figure 14.5 Plots for Example 14.2.3. 


Solution We may use the equation 


1 
Qi = form 
l — ui 


to generate the 20,000 samples from a Rayleigh distribution, where the parameter A is gen- 
erated from a uniform distribution in the interval (0, 1) and o? may be arbitrary selected as 
unity. Then the actual Rayleigh PDF is given by Equation (14.2.12). Figure 14.5 illustrates the 
histogram and the comparison with the actual Rayleigh PDF. a 





14.2.2 Frequency Selective Fading Channel 


The frequency nonselective, slowly fading channel model described above applies to many 
physical radio channels used for digital communications when the bandwidth W of the 
transmitted signal satisfies the condition W < Be». However, there are communication 
systems in which the transmitted signal bandwidth W > Be», so that the channel is fre- 
quency selective. In such a case, a more complex channel model must be employed. 


Tapped Delay Line Channel Model. A general model for a time-variant mul- 
tipath channel is illustrated in Figure 14.6. The channel model consists of a tapped delay 
line with uniformly spaced taps. The tap spacing between adjacent taps is 1/W, where W 
is the bandwidth of the signal transmitted through the channel. Hence, 1/W is the time 


778 Data Transmission in Fading Multipath Channels Chapter 14 


Tn 
Input 








Channel 
output 


Additive Figure 14.6 Model for 
noise time-variant multipath channel. 


resolution that can be achieved by transmitting a signal of bandwidth W. The tap coeffi- 
cients, denoted as {c,(t) = an (thet), are usually modeled as complex-valued, Gaussian 
random processes that are mutually uncorrelated. The length of the delay line corresponds 
to the amount of time dispersion in the multipath channel, which is the multipath spread. 
The multipath spread may be expressed as Tm = L/W, where L represents the maximum 
number of possible multipath signal components. 


Example 14.2.4 
Determine an appropriate channel model for two-path ionospheric propagation, where the 


relative time delay between the two received signal paths is 1 msec and the transmitted signal 
bandwidth W is 10 kHz. 


Solution A 10-kHz signal can provide a time resolution of 1/W = 0.1 msec. Because the 
relative time delay between the two received signal paths is 1 msec, the tapped delay line 
model consists of 10 taps. In this case, only the first tap and the last tap have nonzero, time- 
varying coefficients, denoted as c;(t) and c(t), as shown in Figure 14.7. Because c(t) and 
C2(t) represent the signal response of a large number of ionized particles from two differ- 
ent regions of the ionosphere, c; (t) and c2(t) are modeled as complex-valued, uncorrelated 
Gaussian random processes. The rate of variation of the tap coefficients determines the value 
of the Doppler spread for each path. a 


14.2.3 Models for the Doppler Power Spectrum 


When the tap coefficients {c;,(t)} in the tapped delay line channel model are characterized 
as Gaussian random processes, they may be easily generated by passing complex-valued 
white Gaussian noise through a lowpass filter whose bandwidth is selected to match the 
Doppler spread characteristics of the channel. 
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S Figure 14.7 Figure 
Noise for Example 14.2.4. 


Example 14.2.5 


Let us use a simple digital IIR (infinite duration impulse response) filter that is excited by 

complex-valued white Gaussian noise to generate a sampled version of a channel tap weight 

coefficients. The lowpass IIR filter with two identical poles is described by the z-transform 
a-p}? (l= py 


H(z) = —— n = — 14.2.13 
= (l — pz-!)}? 1 — 2pz-! + p2z-? i 4 


and the corresponding difference equation 
c[n] = 2pe[n — 1] — p?c[n — 2) + A — p} win], (14.2.14) 


where w[n] = w,[n] + jw;[n] is the additive white Gaussian noise (AWGN) input, c[7] is the 
output, and 0 < p < 1 is the pole position. The position of the pole controls the bandwidth 
of the filter and, hence, the rate of variations of c[n]. When p is close to the unit circle, the 
filter bandwidth is narrow, whereas when p is close to zero, the bandwidth is wide. Generate 
1000 samples of c[n] = c,[n] + jci[n] when p = 0.9 and p = 0.99, and plot c,[n], ciin] and 
|c[n]| for each value of p. Also, compute and plot the power spectrum and the autocorrelation 
function of c, [n] for the two values of p by using the formulas given in Computer Problem 5.3. 


Solution Figure 14.8 illustrates time variation in the values of the tap weight coefficients for 
p = 0.9 and p = 0.99, respectively. Figure 14.9 illustrates the autocorrelation functions and 
power spectra of the corresponding values of p. Note the relationship between the rate of time 
variations in the coefficients with the bandwidth of the power spectrum, which is identical to 


the bandwidth of the digital filter. 


Jakes’ Model for the Doppler Power Spectrum. A widely used model for the 
Doppler power spectrum of a mobile radio channel is the so-called Jakes’ model. In this 


model, the autocorrelation of the time-varying transfer function C(f;; t) is given as 


E[C*( fe; t)C (fer t + At)] = I(T fm At), (14.2.15) 
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Figure 14.8 Plots of c,[n], c:[n], and |c[n]| for p = 0.99 (left) and p = 0.9 (right) for Example 14.2.5. Note 
that plots have different vertical scalings. 
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Figure 14.9 Plots of estimated Rx[n] and S(f) for p = 0.9 (top) and p = 0.99 (bottom) for Example 14.2.5. 


where Jo(-) is the zero-order Bessel function of the first kind, which was introduced pre- 
viously in Chapter 3 for characterizing the spectrum of an angle-modulated signal, and 
Ím = vf,/c is the maximum Doppler frequency, where v is the vehicle speed in meters per 
second (m/sec), fs is the carrier frequency, and c is the speed of light (3 x 108 m/sec). The 
Fourier transform of the autocorrelation function in Equation (14.2.15) yields the Doppler 
power spectrum. That is, 
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Figure 14.10 Doppler power spectrum obtained from Jakes’ model. 


Sf) = / i Dn fin Ate IA d At 


—00 


1 P (14.2.16) 
= [nva ds 
0 fl > fm 
The graph of Se( f) is shown in Figure 14.10. 
Example 14.2.6 


Determine the Doppler power spectrum of the fading process experienced by a mobile tele- 
phone user in an automobile traveling at a speed of 100 km/hour. The carrier frequency for the 
mobile telephone system is 1 GHz. 


Solution At the speed of 100 km/hour, the vehicle speed is v = 28 m/sec. Therefore, the 
maximum Doppler frequency is 


Ín = vfele 
=28 x 10°/3 x 108 
= 93 Hz 


and the Doppler power spectrum is 
1 
5(f) = —_—___., < fn 
Bry 1 — (f/93)? lsh 


and zero for | f| > fm- a 


14.2.4 Propagation Models for Mobile Radio Channels 


In the link budget calculations that are described in Section 14.5, we characterize the path 
loss of radio waves propagating through free space as being inversely proportional to d?, 
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where d is the distance between the transmitter and the receiver. However, in a mobile 
radio channel, propagation is generally neither free space nor LOS. The mean path loss 
encountered in mobile radio channels may be characterized as being inversely proportional 
to dP, where 2 < p < 4, with d* being a worst-case model. Consequently, the path loss is 
usually much more severe compared to that of free space. 

There are a number of factors affecting the path loss in mobileradio communications. 
Among these factors are base-station antenna height, mobile antenna height, operating 
frequency, atmospheric conditions, and presence or absence of buildings and trees. Various 
mean path loss models have been developed that incorporate such factors. For example, a 
model for a large city in an urban area is the Hata model, in which the mean path loss is 
expressed as 


Loss in dB = 69.55 + 26.16 logig f — 13.82 logig h: — a (h+) 
+ (44.9 — 6.55log,9 hr) logigd, (14.2.17) 


where f is the operating frequency in MHz (150 < f < 1500), h, is the transmitter antenna 
height in meters (30 < h, < 200), h, is the receiver antenna height in meters (1 < h, < 10), 
d is the distance between transmitter and receiver in kilometers (1 < d < 20), and 


alh) = 3.2(logi9 11.75h,)? — 4.97, f > 400MHz. (14.2.18) 


In mobile radio communications we often encounter the effect of shadowing of the 
signal due to large obstructions, such as large buildings, trees, and hilly terrain between the 
transmitter and the receiver. Shadowing is usually modeled as a multiplicative and, gen- 
erally, slowly time-varying random process; i.e., the received signal may be characterized 
mathematically as 


r(t) = Aog(t)s(t), (14.2.19) 


where Apo represents the mean path loss, s(t) is the transmitted signal, and g(t) is a random 
process that represents the shadowing effect. At any time instant, the shadowing process is 
modeled statistically as lognormally distributed. The PDF for the lognormal distribution is 





1 g-(ng—p)*/20? >0 
f(g) = | Vance © we”) i (14.2.20) 
0, (g < 0) 


If we define a new random variable X as X = In g, then 


1 
f(x) = eee OW? Loo < x < oo. (14.2.21) 


~y 2m0? 


The random variable X is proportional to the path loss measured in dB, p is the mean path 
loss in dB, and ø is the standard deviation of the path loss in dB. For a typical cellular 
environment, ø is in the range of 5-12 dB. 
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14.3 PERFORMANCE OF BINARY MODULATION IN RAYLEIGH FADING CHANNELS 


In this section, we determine the probability of error at the receiver of a binary digital com- 
munication system that transmits information through a Rayleigh fading channel. Although 
we focus our treatment on binary modulation, the basic concepts carry over to M-ary mod- 
ulation without exception. 


14.3.1 Probability of Error in Frequency Nonselective Channels 


The signal bandwidth W is assumed to be much smallerthan the coherence bandwidth Be 
of the channel. Since the multipath components are not resolvable, the channel is frequency 
nonselective and, hence, the channel impulse response is represented as 


c(t; t) = a(t)d(t — to(t)), (14.3.1) 


where a(t) has a Rayleigh distribution at any instant in time. 

We assume that the time variations of a(t) and To(t) are very slow compared to the 
symbol interval, so that within the time interval 0 < t < T, the channel impulse response 
is constant; i.e., 


c(t; t) = c(t) = ad(t — To), (14.3.2) 
where the amplitude œ is Rayleigh distributed; i.e., 


2 2 
a e~a 20" w> 


fa) = (14.3.3) 


‘ otherwise 


Now, suppose that binary antipodal signals, e.g., binary PSK, are used to transmit the 
information through the channel. Hence, the two possible signals are 


Un(t) = Ni “ cos(2nf.t +mn)+n(t), m=0,1. (14.3.4) 


The received signal is the interval 0 < t < T is 


r(t) = CEN = cos(2n fet +mn+¢)+n(t), (14.3.5) 


where ¢ is the carrier-phase offset. Let us assume that œ is known to the demodulator, 
which cross correlates r(t) with 


2 
Ye) = r cos(2n fet +), O<t<T. (14.3.6) 
Hence, the input to the detector at the sampling instant is 


r=a/€,cosman+n, m=0,1. (14.3.7) 
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For a fixed value of a, the probability of error is the familiar form 


[> ey? 
Py(a) = of ~~). (14.3.8) 


We view P,(œ) as a conditional probability of error for a given value of the channel atten- 
uation œ. To determine the probability of error averaged over all possible values of a, we 
compute the integral 


P, = f P (œ) f (œ) da, (14.3.9) 
0 


where f (œ) is the Rayleigh PDF given by Equation (14.3.3). This integral has the simple 
closed form expression 





1 Pb 
Pp=-—|1- ; 14.3.10 
PTD | zA : ) 


where, by definition, 
= Eb 2 
Pp = — E (a°). (14.3.11) 
No 
Hence, ð, is the average received SNR/bit and E (a?) = 20°. 


If the binary signals are orthogonal, as in orthogonal FSK, where the two possible 
transmitted signals are given as 


um(t) = [E fpl: + =) | , m=0,1, (14.3.12) 


the received signal is 


r) = a eas [on (s: + =) t+ | +n(t). (14.3.13) 


In this case, the received signal is cross correlated with the two signals 


y(t) = — cos(27 fet + $), 


(14.3.14) 
2 1 
y(t) = J2 cosf2 (4. + F) t+ J : 
If m = 0, for example, the two correlator outputs are 
ri = a 6b +71, 
i pe (14.3.15) 


rn = mMm, 


where nı and nz are the additive noise components at the outputs of the two correlators. 
Hence, the probability of error is simply the probability that r2 > rı. Since the signals are 
orthogonal, the probability of error for a fixed value of œ has the familiar form 
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| oy? 
P,(œ) = of m) (14.3.16) 


As in the case of antipodal signals, the average probability of error over all values of œ is 
determined by evaluating the integral in Equation (14.3.9). Thus, we obtain 


1 Pb 
paati ' 14.3.17 
b | A] ( ) 


where p, is the average SNR/bit defined by Equation (14.3.11). 

Figure 14.11 illustrates the average probability of error for binary antipodal and 
orthogonal signals. The striking aspects of these graphs is the slow decay of the proba- 
bility of error as a function of SNR. In fact, for large values of pp, the probability of error 
for binary signals is 
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1 
P, ~ —, antipodal signals 
4Pb 
1 (14.3.18) 
P, © —., orthogonal signals 
bp 2 8 


Hence the probability of error in both cases decreases only inversely as the SNR. This is 
in contrast to the exponential decrease in the case of the AWGN channel. We also note 
that the difference in SNR between antipodal signals (binary PSK) and orthogonal signals 
(binary FSK) is 3 dB. 

Two other types of signal modulation are DPSK and noncoherent FSK. For com- 
pleteness, we state that the average probability of error for these signals (see Problem 
14.5) is 





1 
P, = ————~. _ DPSK, 14.3.19 
” 21+ pp) i } 
1 
P, = —, noncoherent FSK. (14.3.20) 
2+ Pp 


14.3.2 Performance Improvement Through Signal Diversity 


The basic problem in digital communication through a fading channel is that a large num- 
ber of errors occur when the channel attenuation is large; i.e., when the channel is in a 
deep fade. If we can supply to the receiver two or more replicas of the same information 
signal transmitted through independently fading channels, the probability that all the signal 
components will fade simultaneously is reduced considerably. If p is the probability that 
any one signal will fade below some critical value, than p? is the probability that all D 
independently fading replicas of the same signal will fade below the critical value. There 
are several ways that we can provide the receiver with D independently fading replicas of 
the same information-bearing signal. 

One method for achieving D independently fading versions of the same information- 
bearing signal is to transmit the same information on D FDM carrier frequencies, where 
the separation between successive carriers equals or exceeds the coherence bandwidth Beb 
of the channel. This method is called frequency diversity. 

A second method for achieving D independently fading versions of the same 
information-bearing signal is to transmit the same information in D different time slots, 
where the time separation between successive time slots equals or exceeds the coherence 
time Tg of the channel. This method is called time diversity. 

Another commonly used method for achieving diversity is via use of multiple receiv- 
ing antennas, but only one transmitting antenna. The receiving antennas must be spaced 
sufficiently far apart so that the multipath components in the signal have significantly dif- 
ferent propagation paths, as illustrated in Figure 14.12. Usually, a separation of a few wave- 
lengths is required between a pair of receiving antennas in order to obtain signals that fade 
independently. 

Other diversity transmission and reception techniques that are used in practice are 
angle-of-arrival diversity and polarization diversity. 
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Figure 14.12 Illustration of diversity 
reception using two receiving antennas. 





Given that the information is transmitted to the receiver via D independently fading 
channels, there are several ways that the receiver may extract the transmitted information 
from the received signal. The simplest method is for the receiver to monitor the received 
power level in the D received signals and to select for demodulation and detection the 
strongest signal. In general, this approach results in frequent switching from one signal 
to another. A slight modification that leads to a simpler implementation is to use a signal 
for demodulation and detection as long as the received power level in that signal is above 
a preset threshold. When the signal falls below the threshold, a switch is made to the 
channel which has the largest received power level. This method of signal selection is 
called selection diversity. 

For better performance, we may use one of several more complex methods for com- 
bining the independently fading received signals as illustrated in Figure 14.13. One that is 
appropriate for coherent demodulation and detection requires that the receiver estimate and 
correct for the different phase offsets on each of the D received signals after demodulation. 
Then, the phase-corrected signals at the outputs of the D demodulators are summed and fed 
to the detector. This type of signal combining is called equal-gain combining. If, in addi- 
tion, the received signal power level is estimated for each of the D received signals, and 
the phase-corrected demodulator outputs are weighted in direct proportion of the received 
signal strength (square root of power level) and then fed to the detector, the combiner is 
called a maximal-ratio combiner. On the other hand, if orthogonal signals are used for 
transmitting the information through D independently fading channels, the receiver may 
employ noncoherent demodulation. In such a case the outputs from the D demodulators 
may be squared, summed, and then fed to detector. This combiner is called a square-law 
combiner. 

All these types of combining methods lead to performance characteristics that result 
in a probability of error which behaves as K p/” where Kp is a constant that depends on 
D, and ĝ is the average SNR/diversity channel. Thus, we achieve an exponential decrease 
in the error probability. Without providing a detailed derivation, we simply state that for 
antipodal signals with maximal ratio combining, the probability of error has the gen- 
eral form 


p>, (14.3.21) 
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where Kp is defined as 
_ @D-—1)! 


D= DD- D! (14.3.22) 


For binary orthogonal signals with square-law combining, the probability of error has the 
asymptotic form 


K 
Pp & zD p>. (14.3.23) 


Finally, for binary DPSK with equal gain combining, the probability of error has the 
asymptotic form 
Kp 


x OPP’ pl. (14.3.24) 


Pp 

These error probabilities are plotted in Figure 14.14 for D = 1, 2, 4 as a function of 

the SNR/bit p, = Dp. It is apparent that a large reduction in SNR/bit is achieved in having 

D = 2 (dual diversity) compared to no diversity. A further reduction in SNR is achieved 

by increasing the order of diversity to D = 4, although the additional gain from D = 2 

to D = 4 is smaller than going from D = 1 to D = 2. Beyond D = 4, the additional 
reduction in SNR is significantly smaller. 
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Figure 14.14 Performance of binary signals with diversity. 


This analysis of the performance of binary modulation focused on Rayleigh fading 
signal statistics. In general, the Rayleigh distribution is suitable for modeling the signal 
fading that occurs in ionospheric propagation and mobile cellular systems. However, there 
are other statistical models that have been used for a variety of fading multipath channels. 
The most common of these are the Nakagami distribution and the Rician distribution. 


Benefits of Coding in Achieving Signal Diversity. These performance results 
illustrate that efficient use of transmitter power in a Rayleigh fading channel can be 
achieved by using some form of diversity to provide the receiver with several indepen- 
dently fading signals all carrying the same information. The types of diversity that we 
described (time or frequency) are a form of channel coding usually called repetition coding 
where the code rate is 1/ D. Thus, if each information bit is transmitted twice in two widely 
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separated time slots or in two widely separated frequency bands, we have a dual diversity 
(D =2) system obtained with a repetition code of rate Re = 1/2. However, in general, a 
nontrivial code of rate 1/2 will yield significantly better performance if the coded bits are 
interleaved prior to transmission, as described in Chapter 13, so that the fading on each bit 
of a code word is statistically independent. In particular, a binary linear (n, k) code with 
minimum Hamming distance dmin results in a performance that is equivalent to a repetition 
code of diversity dmin when soft-decision decoding is used and dmin/2 when hard-decision 
decoding is used. Therefore, for any code rate 1/D, a nontrivial code can be selected which 
has a minimum Hamming distance dmin > D and, thus, provides a larger order of diversity 
than the corresponding repetition code of the same rate. 


Example 14.3.1 
Compare the error-rate performance of binary orthogonal FSK with dual diversity with the 
performance of the rate 1/2, dmin = 8, extended Golay (24, 12) code in which binary orthog- 
onal FSK is used to transmit each code bit. The channel is a Rayleigh fading channel and the 
receiver employs square-law combining and detection for both types of signals. 


Solution Let us assume that signal diversity is obtained by interleaving the coded bits so 
that we have statistically independent fading among the coded bits for both signals. Note that, 
the repetition code and the Golay (24, 12) code, are rate 1/2 codes. For the repetition code, we 
combine the square-law detected FSK signals in the two (interleaved) signal intervals. Hence, 
if a O is transmitted, the two metrics at the combiner output corresponding to a 0 and a 1, 
respectively, are 


ro = lo VB eti + nol? + lon VB eit +no?|’, 


2 2 
ri = |m? + lawl’, 


(14.3.25) 


where the {n;;} are statistically independent and identically distributed (i.i.d.) complex-valued, 
zero-mean Gaussian random variables. The probability of error is simply the probability that 
r2 > rj, and is given as 


B — (pp/2)° 
where the average SNR/bit is p». In the Golay code, there are 2! = 4096 code words, so that 
there are 4096 metrics at the output of the square-law combiner. To compute the probability 
of error, we may assume that the all-zero code word is transmitted. Then the combiner output 
corresponding to the all-zero code word is the metric 


24 
ro = Y` love! + no. (14.3.27) 


=1 


In the Golay (24, 12) code, there are 759 code words having distance dmin = 8 from the all-zero 
code words. Since any one of these code words differs in 8 bits from the all-zero code word 
and is identical with the all-zero word in 16 bits, the combiner output corresponding to any 
one of these 759 code words is statistically equivalent to the metric 


8 24 
n=) au + Do lov ei% + nol. (14.3.28) 
k=1 k=9 


Section 14.3 Performance of Binary Modulation in Rayleigh Fading Channels 791 


Hence, the difference between ro and r; is 


8 
n-n=)> [lov ei% + ngl? — Inu]. (14.3.29) 


k=1 


We observe that this difference is a function of summing the received signal over eight inde- 
pendently fading bits and, consequently, the code provides an order of diversity of dmin = 8. 
This implies that the error rate for the Golay code decays inversely as 5; i.e., the bit-error 
rate is 

P, ~ Z £ (14.3.30) 

2 SS , 3. 
PS (6,/2)° 

where K is a constant independent of SNR, 6, = 0/R, is the SNR/bit and R, is the code 
rate. Figure 14.15 illustrates the error probability for the two types of signals. Note that the 
Golay code outperforms the repetition code by over 8 dB at a bit-error probability of 10-*. 
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Figure 14.15 Comparison of performance of repetition code with Golay (24, 12) code. 
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The difference is even greater at lower error rates. In conclusion, a nontrivial code with inter- 
leaving generally provides more signal diversity than a repetition code of the same code 
rate. r 


v 


14.3.3 The RAKE Demodulator and Its Performance in Frequency Selective Channels 


Let us considerthe case in which the available channel bandwidth W exceeds the coherence 
bandwidth Bæ of the channel, and we transmit digital information at a symbol rate 1/T 
by modulating a single carrier frequency. We assume that the symbol duration T satisfies 
the condition T & Te. Consequently, the channel characteristics change very slowly in 
time, so that the channel is slowly fading, but it is frequency selective because W >> Bo. 
Furthermore, we assume that T > Tn so that ISI is negligible. 

Since W is the bandwidth of the bandpass signal, the bandwidth occupancy of the 
equivalent lowpass signal is W/2. Hence, we employ a band-limited lowpass signal s(t). 
Using the channel model for the frequency selective channel shown in Figure 10.4, we may 
express the received signal as 


L 
r(t)= J en(t)s(t —n/W) +n), (14.3.31) 


n=1 


where n(t) represents the AWGN. Therefore, the frequency selective channel provides 
the receiver with up to L replicas of the transmitted signal, where each signal component 
is multiplied by a corresponding channel tap weight c,(t), n = 1,2,..., L. Based on 
the slow fading assumption, the channel coefficients are considered as constant over the 
duration of one or more symbol intervals. 

Since there are up to L replicas of the transmitted signal s(t) in r(t), a receiver that 
processes the received signal in an optimum manner will achieve the performance that is 
equivalent to that of acommunication system with diversity equal to the number of received 
(resolvable) signal components. 

Let us consider binary signaling overthe channel. Suppose we have two equal energy 
signals sı (t) and s2(t), which are orthogonal, with a time duration T > Tm. Since the inter- 
symbol interference (ISI) is negligible, the optimum receiver consists of two correlators 
or two matched filters that are matched to the received signals. Let us use the correla- 
tor structure that is illustrated in Figure 14.16. The received signal is passed through a 
tapped delay-line filter with tap spacing of 1/W, as in the channel model. The number of 
taps is selected to match the total number of resolvable signal components. At each tap, 
the signal is multiplied with each of the two possible transmitted signals s(t) and s2(t), 
and, then, each multiplier output is phase corrected and weighted by multiplication with 
c*(t),n = 1,2,..., L. Then, the corresponding phase-aligned and weighted signal com- 
ponents are integrated over the duration of the symbol interval T and the two integrator 
outputs are sampled periodically every T sec. Their outputs are then sent to the detector. 
Thus, we have cross correlated the received signal with each of the two possible transmit- 
ted signals at all possible delays introduced by the channel. Note that the multiplication 
of the signal at each tap with the corresponding tap coefficient c(t) results in weighting 
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Figure 14.16 RAKE demodulator for signal transmitted through a frequency selective channel. 


the signal components by the corresponding signal strengths. Hence, the combining of the 
phase-corrected and weighted signal components corresponds to maximal ratio combining. 

In order to perform maximal ratio combining, it is necessary to estimate the channel- 
tap coefficients c, (t) from the received signal. Since these coefficients are time varying, it 
is necessary for the estimator to be adaptive, i.e., to be able to track the time variations. 

The demodulator structure shown in Figure 14.16 is called a RAKE demodulator. 
Because this demodulator has equally spaced taps with tap coefficients that essentially 
collect all the signal components in the received signal, its operation has been likened to 
that of an ordinary garden rake. 

Assuming thatthere are L signal components in the received signal, with correspond- 
ing signal strengths that are distinct and Rayleigh distributed, the probability of error for 
binary signals is well approximated as 
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L 
1 
P, = Ky I] ade (14.3.32) 


where (x is the average SNR for the kth-signal component; i.e., 
Bk = —E(a), (14.3.33) 


where a = |cx| is the amplitude of the kth-tap coefficient, y, = —1 for antipodal signals 
and y, = 0 for orthogonal signals, and Kz is the constant defined in Equation (14.3.22). In 
the special case where all the signal components have the same strength, the error probabil- 
ity for antipodal signals in Equation (14.3.32) reduces to that given by Equation (14.3.21) 
with D = L. 


14.3.4 OFDM Signal Transmission in Frequency Selective Channels 


From the viewpoint of communication system performance, the RAKE demodulator is the 
optimum demodulator for reception of a single carrier signal transmitted over a frequency 
selective channel in which the symbol duration is designed to satisfy the condition T > T,, 
and T & Ty; i.e., 


Tn KT K To. (14.3.34) 


Furthermore, for a frequency selective channel, the bandwidth of the transmitted signal 
satisfies the condition 


W > Bo. (14.3.35) 


The combination of the conditions in Equations (14.3.34) and (14.3.35) implies that 
TW > 1 or, equivalently, W > Ł. We note that the condition T >> Tm is imposed on 
the signal design to avoid the effects of ISI, which generally degrades the performance of 
the system. Consequently, the data (symbol) rate must be reduced in order to satisfy the 
condition T >> Tm or, equivalently, the bandwidth W of the transmitted signal is selected 
such that WT > 1. 

A more bandwidth-efficient modulation method for a frequency selective channel is 
OFDM, wherethe signal bandwidth W is subdivided into a large number N of subchannels 
for which the symbol duration of the signa transmitted on each subchannel is selected to 
satisfy the condition T >> Tm. Hence, ISI is rendered negligible in each subchannel and can 
be totally eliminated by the use of a time guard band of duration Tn or a cyc ic prefix. Thus, 
the frequency separation between adjacent subchannels is Af = 1/T and the number of 
subchannels is N = W/Af, so that each subchannel is basically characterized as frequency 
nonselective. To combat signal fading in this situation, the same information symbol can 
be transmitted in two or more subchannels separated in frequency by an amount equal to or 
exceeding the channel coherence bandwidth Be» in order to obtain signal diversity through 
statistical y independent fading. The following illustrative problem describes the OFDM 
signal design process. 
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Example 14.3.2 
Suppose a communication channel having a bandwidth W = 10 kHz is characterized as a 
multipath channel with multipath spread Tm = 10 ms and a Doppler spread Ba = 0.1 Hz. 
Select the parameters of an OFDM system such that the bandwidth loss due to the cyclic 
prefix (or time guard band) does not exceed 10%. 


Solution We may select the symbol duration to be T = 100 msec to satisfy the bandwidth 
loss constraint. Therefore, Af = 1/T = 10 Hz and, hence, the number of subchannels is 
N = 104/10 = 1000. The channel has a coherence time of Te = 1/Ba = 10 seconds, so we 
have satisfied the condition that T « Ta. The channel coherence bandwidth Bea = 1/ Tm = 
100 Hz. Tocombat signal fading in a subchannel, we may transmit the same symbol on multi- 
ple subchannels having a frequency separation of at least 100 Hz. Thus, the symbol throughput 
achieved on this channel is N/T = 10N symbols/sec without diversity and the throughput 
symbol rate Rs = 10N/D symbols/sec with diversity of order D. a 
Example 14.3.3 

Consider an HF channel which has a nominal bandwidth of 3200 Hz and a multipath spread 
of 7,, = 1 msec. Design a multiple-carrier OFDM signal that achieves a data rate of 4800 
bits/sec. 

Solution We may select the number N of subcarriers to be as large as we like so as to 
achieve the desired condition that Ts >> Tn, where Tsc is the symbol duration for each sub- 
carrier. However, the complexity of the demodulator increases, as N log, N [computational 
complexity of fast Fourier transform (FFT) algorithm] and the demodulation delay for deliv- 
ering the information to the user increases (linearly) with N. Therefore, it is desirable to keep 
N as small as possible. Suppose we select N such that Tse = 100 msec. Then, each subchannel 
may be as narrow! as Wy, © + = 10 Hz. Note that W,. < Bæ = 1000 Hz as desired. If we 
employ four-phase (PSK or DPSK) modulation in each subchannel, we achieve a bit rate of 
20 bits/sec, per subchannel. With N = 240 subchannels, we achieve the desired data rate of 
4800 bps. a 


14.4 MULTIPLE ANTENNA SYSTEMS 


The use of two or more antennas at the receiving terminal of a digital communication 
system is a commonly employed method for achieving spatial diversity and, thus, for mit- 
igating the effects of signal fading. Typically, the receiving antennas must be separated by 
one or more wavelengths to ensure that the received signals undergo statistically indepen- 
dent fading. Spatial receiver diversity is especially attractive because the signal diversity is 
achieved without expanding the signal transmission bandwidth. 

Spatial diversity can also be achieved by using multiple antennas at the transmit- 
ter. For example, we will demonstrate that it is possible to achieve dual diversity with 
two transmitting antennas and one receiving antenna. Furthermore, multiple transmitting 
antennas can be used to create multiple spatial channels and, thus, provide the capability 
to increase the data rate. This method is called spatial multiplexing. 

A communication system employing Nr transmit antennas and Np receive anten- 
nas is generally called a multiple-input, multiple-output (MIMO) system and the resulting 
spatial channel in such a system is called a MIMO channel. 


"In practice, it will be necessary to have some excess bandwidth in each subchannel. The excess bandwidth 
may be in the range of 15-25%. 
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The special case in which Nr = Nr = 1 is called a single-input, single-output 
(SISO) system, and the corresponding channel is called a SISO channel. A second special 
case is one in which Nr = 1 and Nr > 2. The resulting system is called a single-input, 
multiple-output (SIMO) system, and the corresponding channel is called a SIMO channel. 
Finally, a third special case is one in which Nr > 2 and Nr = 1. The resulting system is 
called a multiple-input, single-output (MISO) system, and the corresponding channel is a 
MISO channel. 


14.4.1 Channel Models for Multiple Antenna Systems 


In a MIMO system with Nr transmit antennas and Np receive antennas, we denote the 
impulse response between the jth transmit antenna and the ith receive antenna by c;;(T; t), 
where t is the delay variable and ¢ is the time variable in a general linear, time-varying 
channel. Thus, a randomly time-varying channel is characterized by the Nr x Nr matrix 
C(t; t), defined as 


Cnt) ci2(T;t) >>> Cin tee) 
c21(T;t) cats t) ++> Cony (B34) | 

C(t; t)= . . ; (14.4.1) 
Cnpi(t;t) Crng2(T; t) +++ Cogar (Ts É) | 


For a frequency nonselective channel, the channel matrix C is expressed as 


c(t) cyt) -+> cin, (t) 
cai(t) c(t) +++ can, (t) 

Cit) = . . . X (14.4.2) 
e E 48 A 


Suppose that the signal transmitted from the jth transmit antenna is sj(t), j = 1, 


2,..., Nr. Then the signal received at the ith antenna may be expressed as 
NT 
ri(t) =) ciy(t)sj(t), i =1,2,..., Ng (14.4.3) 
j=l 


and, in matrix form, the received signal vector r (t) is given as 


r(t) =C(t)s(), (14.4.4) 


where S(t) is an Nr x 1 vector and r(t) is an Nr x 1 vector. Furthermore, if the time 
variations of the channel impulse response are very slow within a time interval 0 < t < T, 
when T may be either the symbol interval or some general time interval, Panarion (14.4.4) 
may be simply expressed as 


r(t)=Cs(t), 0<t <T, (14.4.5) 


where C is constant within the time interval 0 < t < T. 
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The slowly time-variant frequency nonselective channel model embodied in Equation 
(14.4.5) is the simplest model for signal transmission in a MIMO channel. In this section, 
we employ this model to illustrate the performance characteristics of MIMO systems. 


14.4.2 Signal Transmission in a Slow Fading Frequency Nonselective MIMO Channel 


Consider a wireless communication system that employs multiple transmitting and receiv- 
ing antennas, as shown in Figure 14.17. We assume that there are N7 transmitting antennas 
and Np receiving antennas. As illustrated in Figure 14.17, a block of Nr symbols is con- 
verted from serial to parallel and each symbol is fed to one of Nr identical modulators, 
where each modulator is connected to a spatially separate antenna. Thus, the Ny symbols 
are transmitted in parallel and received on Np spatially separated receiving antennas. In 
this section, we assume that the Ny symbols are uncoded. 

Let us assume that each signal from a transmitting antenna to a receiving antenna 
undergoes frequency nonselective Rayleigh fading. We also assume that the differences in 
propagation times of the signals from the Npr transmitting to the Nr receiving antennas 
are small relative to the symbol duration T, so that for all practical purposes, the signals 
from the Nr transmitting antennas to any receiving antenna are synchronous. Hence, we 
can represent the received signals at the receiving antennas in a signaling interval as 


NT 
Tm(t) = J Sn€mn8p (t) + 2m(t), 0<t <T, m=1,2,..., NR, (14.4.6) 
n=1 
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Figure 14.17 A communication system with multiple transmitting and receiving antennas. 
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where g,(t) is the pulse shape (impulse response) of the modulation filters, Cmn is the 
complex-valued zero-mean Gaussian channel gain between the nth transmitting antenna 
and the mth receiving antenna, s, is the symbol transmitted on the nth antenna, and z,,(t) 
is a sample function of an AWGN noise process. The channel gains {Cmn} are modeled as 
identically distributed and statistically independent from channel to channel. The Gaussian 
sample functions {z,,(t)} are assumed to be identically distributed and mutually statisti- 
cally independent, each having zero mean and two-sided power spectral density No/2. The 
information symbols {Sn} are drawn from either a binary or M-ary PSK or QAM signal 
constellation. 

The demodulator for the signal at each of the Nr receiving antennas consists of 
a matched filter to the pulse g,(¢), whose output is sampled at the end of each symbol 
interval. The output of the demodulator corresponding to the mth receiving antenna can be 
represented as 


Nr 
Ym = Slr ge M= 1, 2, ..., NR, (14.4.7) 


n=1 


where the energy of the signal pulse g, (t) is normalized to unity and ^m is the additive 
Gaussian noise component. The Nr soft outputs from the demodulators are passed to 
the signal detector. For mathematical convenience, Equation (14.4.7) may be expressed 
in matrix form as 


y=Cs +n, (14.4.8) 


where y = [y1, Y2,- . -s YNgl S = [51, 52,..., Syr], 0 = [m, Mm, -. -> Mvp)’ and C is the 
Nr x Nr matrix of channel gains. Figure 14.18 illustrates the discrete-time model for the 
multiple transmitter and receiver signals in each signaling interval. 

In the formulation of a MIMO system as described above, we observe that the trans- 
mitted symbols on the Nr transmitting antennas overlap totally in both time and frequency. 
As a consequence, there is interchannel interference in the signals {Ym, 1 < m < Nr} 
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Figure 14.18 Discrete-time model of the communication system with multiple transmit and receive antennas 
in frequency nonselective slow fading channel. 
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received from the spatial channel. In the following section, we consider three different 
detectors for recovering the transmitted data symbols in a MIMO system. 


Example 14.4.1 
Suppose Nr = Nr = 2. Generate the elements of the channel matrix C for a Rayleigh fading 
AWGN channel and the corresponding inputs to the detectors at the two receive antennas. 


Solution The elements of C are c11, c12, €21, and cz). For the Rayleigh fading channel, these 
parameters are complex-valued, statistically independent, zero-mean Gaussian random vari- 
ables with identical variances a. Hence, the two inputs to the detectors at the two antennas are 


Yı = C1181 + C1252 +m, 
Ya = C218, + C2252 + M, 


where s; and s are the transmitted symbols from the two transmit antennas and (nı, 72) 
are the statistically independent additive Gaussian noise terms with zero mean and equal 
variances o? m 


14.4.3 Detection of Data Symbols in a MIMO System 


Based on the frequency nonselective MIMO channel model described in Section 14.4.2, we 
consider three different detectors for recovering the transmitted data symbols and evaluate 
their performance for Rayleigh fading and additive white Gaussian noise. Throughout this 
development, we assume that the detector knows the elements of the channel matrix C 
perfectly. In practice, the elements of C are estimated by using channel probe signals. 


Maximum-Likelihood Detector. The maximum -likelihood detector (MLD) is 
the optimum detector in the sense that it minimizes the probability of error. Because the 
additive noise terms at the Nr receiving antennas are statistically independent and iden- 
tically distributed (i.i.d.), zero-mean Gaussian, the joint conditional PDF f (y|s) is Gaus- 
sian. Therefore, the MLD selects the symbol vector $ that minimizes the Euclidean distance 
metric 


NR Nr 2 
us) = J |Ym— D> CmnSn (14.4.9) 
m=1 n=1 








Minimum Mean-Square-Error Detector. The minimum mean-square-error 
(MMSE) detector linearly combines the received signals {ym, 1 < m < Nr} to form an 
estimate of the transmitted symbols {s,, 1 < n < N7}. The linear combining is represented 
in matrix form as 


§=W"y, (14.4.10) 
where W is an Nr x Nr weighting matrix, which is selected to minimize the mean square 
error 


J(W) = Efilell?] = Ells — W” y1’. (14.4.11) 
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Minimization of /(W) leads to the solution for the optimum weight vectors w,, w2,... 
Wn, as 


’ 


w, = Ror  n=1,2,..., Np, (14.4.12) 


where Ryy = E[yy#] = CR,,C# + Nol is the (Ng x Np) autocorrelation matrix of 
the received signal vector y, Rss = E[ss"], rs, y = E[sžy] and E[ņņ”] = NoI. When 
the signal vector has uncorrelated, zero-mean components, Rss is a diagonal matrix. Each 
component of the estimate § is quantized to the closest transmitted symbol value. 


Inverse Channel Detector. The inverse channel detector (ICD) also forms an 
estimate of s by linearly combining the received signals {y,,1 < m < Nr}. However, 
in this case, we set Nr = Np and select the weighting matrix W so that the interchannel 
interference is completely eliminated; that is, w = C™! and, hence, 

§=C ly 
(14.4.13) 
=s +C! 


Each element of the estimate $ is then quantized to the closest transmitted symbol value. 
We note that the ICD estimate $ is not corrupted by interchannel interference. However, 
this also implies that the ICD does not exploit the signal diversity inherent in the received 
signal, as we will observe below. 

When Nr > Nr, the weighting matrix W may be selected as the pseudoinverse of 
the channel matrix; that is, 


We =©€*0)'e", (14.4.14) 
The three detectors are implemented in MATLAB in Computer Problem 14.9. 


14.4.4 Error Rate Performance of the Detectors 


The error rate performance of the three detectors in a Rayleigh fading channel is most 
easily evaluated by Monte Carlo computer simulation. 


Example 14.4.2 
Perform a Monte Carlo simulation to assess the errorrate performance of an (N7, Nr) MIMO 
system in a Rayleigh fading AWGN channel. The modulation is binary PSK (or binary PAM). 


Solution Figures 14.19 and 14.20 illustrate the binary error rate (BER) for binary PSK mod- 
ulation with (Nr, Nr) = (2, 2) and (Nr, Nr) = (2, 3), respectively. In both cases, the vari- 
ances of the channel gains are identical and their sum is normalized to unity; that is, 


> Ellen! = 1. (14.4.15) 


The BER for binary PSK modulation is plotted as a function of the average SNR per bit. With 
the normalization of the variances in the channel gains {Cmn} as given by Equation (14.4.15), 
the average received energy is simply the transmitted signal energy per symbol. 

The performance results in Figures 14.19 and 14.20 illustrate that the MLD exploits 
the full diversity of order Nr available in the received signal and, thus, its performance is 
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Figure 14.19 Performance of MLD, MMSE, and ICD (detectors) with Ng = 2 receiving antennas. 
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Figure 14.20 Performance of MLD and MMSE detectors with Ng = 3 receiving antennas. 
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comparable to that of a maximal ratio combiner (MRC) of the Nr received signals, without the 
presence of interchannel interference; that is, (Nr, Nr) = (1, Nr). The two linear detectors, 
the MMSE detector and the ICD, achieve an error rate that decreases inversely as the SNR 
raised to the (Nr — 1) power for Nr = 2 transmitting antennas. Thus, when Ng = 2, the 
two linear detectors achieve no diversity, and when Nr = 3, the linear detectors achieve dual 
diversity. We also note that the MMSE detector outperforms the ICD, although both achieve 
the same order of diversity. In general, with spatial multiplexing (Nr antennas transmitting 
independent data streams), the MLD detector achieves a diversity of order Nz and the linear 
detectors achieve a diversity of order Ng — Nr + 1, for any Nr > Nr. In effect, with Nr 
antennas transmitting independent data streams and Ne receiving antennas, a linear detector 
has Nr degrees of freedom. In detecting any one data stream, in the presence of Nr — 1 
interfering signals from the other transmitting antennas, the linear detectors utilize Ny — 1 
degrees of freedom to cancel the Nr — 1 interfering signals. Therefore, the effective order of 
diversity for the linear detectors is Ng — (Nr — 1) = Ne — Nr +1. E 


It is interesting to compare the computational complexity of the three detectors. We 
note that the complexity of the MLD grows exponentially as M*T, where M is the number 
of points (symbols) in the signal constellation, whereas the linear detectors have a com- 
plexity that grows linearly with Ny and Ne. Therefore, the computational complexity of 
the MLD is significantly larger than that of the linear detectors when Ny and M are large. 
However, fora small number of transmit antennas and small number of signal constellation 
symbols (i.e., Nr < 4 and M = 4), the computational complexity of MLD is reasonable. 


14.4.5 Space-Time Codes for MIMO Systems 


Let us now consider the MIMO system illustrated in Figure 14.21. At the transmitter, the 
sequence of information bits is fed into a symbol mapper that maps a block of bits into 
signal points {s;} selected from a signal constellation such as PAM, PSK, or QAM, con- 
sisting of M = 2° signal points. These signal points are fed as a block to a space-time 
encoder that maps the information symbols to a parallel set of identical modulators. In 
tum, the modulators map the signal points into corresponding waveforms that are trans- 
mitted simultaneously on the Ny antennas. Below, we describe two types of space-time 
codes: block codes and trellis codes. 


Space-Time Block Codes. A space-time block code (STBC) is defined by a 
generator matrix G, having N rows and Nr columns, of the form 


811 812 *** SINT 
821 822 °°: 82NT l 

G=|. (14.4.16) 
8N1 8N2 °*** 8NNT 


in whichthe elements {g;;} are signal points resulting from a mapping of information bits to 
corresponding signal points from a binary or M-ary signal constellation. By employing Nr 
transmit antennas, each row of G may contain up to Np different signal points (symbols), 
which are transmitted on the Nr antennas in a time slot. Thus, the first row of symbols in 
G is transmitted on the Ny antennas in the first time slot, the second row of symbols in G 
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Figure 14.21 Space-time block-coded MIMO system. 


is transmitted on the Ny antennas in the second time slot, and the Nth row of symbols in G 
is transmitted on the Nr antennas inthe Nth time slot. Therefore, N time slots are used to 
transmit the symbols in the N rows of the generator matrix G. The ratio of the number of 
different symbols transmitted to the number of time slots is called the spatial code rate Rs. 

In the design of the generator matrix of a STBC, it is desirable to focus on three 
principal objectives: (a) achieving the highest possible diversity of Nr Nr, (b) achieving 
the highest possible (throughput) rate, and (c) minimizing the complexity of the MLD. 
Our treatment considers these three objectives. 


The Alamouti STBC. Alamouti (1998) devised a STBC for Nr = 2 transmit 
antennas and Nr = 1 receive antenna. The generator matrix for the Alamouti code is 
given as 


G= ies - | l (14.4.17) 
2>"1 


where sı and sz are two signal points selected from an M-ary PAM, or PSK or QAM signal 
constellation with M = 2° signal points. Thus, 2b data bits are mapped into two signal 
points (symbols) sı and s2 from the M-ary signal constellation. The symbols sı and s2 
are transmitted on the two antennas in the first time slot and the symbols —s} and sf are 
transmitted on the two antennas in the second time slot. Thus, two symbols, sı and s2, are 
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transmitted in two time slots. Consequently, the spatial code rate R;, which is defined as 
the ratio of the number of symbols transmitted to the numbers of time slots used to transmit 
the symbols, is 1 for the Alamouti code. This is the highest possible rate for a (orthogonal) 
STBC. 

Let us assume that the symbols sı and sz are signal points in a QAM signal constella- 
tion. These signal points modulate the quadrature carriers cos 27 fet and sin 27 fet. Hence, 
the modulator output signals fed to the two antennas in the first time interval 0 < t < T are 


u (t) = Ame187 (t) cos 27 fet — Ams18r (t) sin 27 fet, 
i (14.4.18) 
uí D) = = Amc28r (t) cos 27 fet — Ams287(t) sin 27 fot, 
where s; = (VEs Amei VE; Amsi) „i = 1, 2, and g, (t) is a rectangular pulse, defined as 
ff O<t<T 
8 (t) = (14.4.19) 


otherwise _ 


The superscripts on u,,; (t) and u,,2(t) denote the signal transmitted in the first interval. In 
the second time interval (T < t < 2T), the signal points to be transmitted are —sj and ST. 
Hence the signals to be transmitted on the two antennas are 


u?) (t) = —Ame287 (t — T) cos 27 fe(t — T) — Ams287 (t — T) sin2nf,(t — T 


uÈ) = Ameigr (t — T) cos 27 felt — T) + Ams18r(t — T) sin27 fe(t — T). 
(14.4.20) 


The MISO channel matrix for the Nr = 2, Nr = 1 channel, based on a frequency 
nonselective model, is 


C= [eu c12] h (14.4.21) 


In the decoding of the STBC, we assume that C is constant over the two time slots and that 
it is known at the receiver. Hence, the received signal in the first time interval is 


rO) = cub e) + cru) +2) (14.4.22) 
and in the second time interval 
rO) = cu A) + eue) +2, (14.4.23) 


where n” (t) and n® (t) are the AWGN noise terms. 
At the receiver the, r”? (t) is cross correlated with the basis functions y; (t) and W2(t) 


Wilt) = 8r (t) cos 27 fet, 
palt) = —87 (t) sin 27t fet, 


(14.4.24) 
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and r® (t) is cross correlated with w(t — T) and W(t — T). Consequently, the outputs of 
the correlators at the sampling instants t = T and t = 2T for the two time slots are 


Yı = C1181 + C1252 +1, 
: i (14.4.25) 
y2 = —C11S2 +128; + 2, 


where 7, and ņ are zero-mean, circularly symmetric complex valued uncorrelated 
Gaussian random variables with equal variance o?. 
The correlator outputs y; and y2 in Equation (14.4.25) are fed to the detector, which 


computes the estimates of the symbols sı and s2 as follows: 


ŝi = yey, + y2¢12 
7 (14.4.26) 
S2 = yich = y3C11 


and it selects the symbol 5, and S, that are closest to $; and S2 in Euclidean distance. If we 
substitute for y; and y2 in (14.4.26) and carry out the multiplications, we obtain 


ŝi = [len]? + lcl] s1 + cfm + e120}, 
7 | A : (14.4.27) 
82 = [lc]? + lel? ] s2 + chm — crn}. 


We make the following observations on the Alamouti STBC. First, we observe that 
the code achieves dual diversity, which is the largest possible for transmission over the 
two transmit antenna and one receive antenna. Second, the MLD that computes Equation 
(14.4.26) is very simple. These two desirable properties were achieved as a result of the 
orthogonality characteristic of the generator matrix G for the Alamouti code, which we 
may express as 


G= E : A (14.4.28) 
—8&2 8 


We observe that the column vectors vı = (g1, —83)' and v2 = (82, gf) are orthogonal 
(i.e. vı - v = 0) and, furthermore, 


G"G = [lei + le2] bb (14.4.29) 


where I, is a 2 x 2 identity matrix. As a consequence of this orthogonality property, when 
we express the received signal given in Equation (14.4.25) as 


Yi |_| Cit €12 S1 m1 
lL LS] 
X2 Cin Cl s2 n2 


y=Cyst+y (14.4.30) 


or 
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and form the estimates 5, and $, as prescribed in Equation (14.4.26) from y in Equation 


(14.4.30), we obtain 
ŝi _ (en e |» 
82 Cin ~eu| (y3 


= CHOns + CH (14.4.31) 
= [len + len? ]s +CËn 
Therefore, 
CCa = [leu + lel] T2 (14.4.32) 


Thus, full diversity and low decoding complexity are achieved as a consequence of the 
property of G given in Equation (14.4.29). 


Example 14.4.3 

Perform a Monte Carlo simulation to estimate the error rate performance of an Nr = 2, 
Nr = 1 multiple antenna system thatemploys the Alamouti STBC. Hence, generate the input 
to the detector as given in Equation (14.4.25), where the signal points are selected from a 
QPSK constellation; c,, and cj are statistically independent, complex-valued, zero-mean 
Gaussian random variables with unit variance; and 7, and m2 are also statistically indepen- 
dent, complex-valued, zero-mean Gaussian random variables with variance o?. The detector 
computes the estimates as in Equation (14.4.26) and decides on which symbols are closest 
to §, and $, in Euclidean distance. Perform the above computations for N = 10,000 itera- 
tions for any given value of o?, where in each iteration the channel coefficients (c11, ciz), the 
signal points (sı, 52), and, additive noise terms (7), 2) are selected independently.. Plot the 
measured symbol error rate as a function of the SNR = 10 log, (€,/207), where €, = €,/2 
is the energy per bit, which may be normalized to unity for convenience. 


Solution The graph for the estimated error rates as a function of SNR is shown in Figure 
14.22. We observe that the probability of error decreases inversely as the square of the SNR, 
i.e., the Alamouti code yields dual diversity when (N7, Nr) = (2, 1). | 


The Alamouti code is an example of orthogonal complex matrix design for Nr = 2. 
It has been shown in the literature [see Jafarkhani (2005) and Tarokh et al. (1999)] that 
orthogonal complex matrix designs with R, = 1 do not exist for Nr > 2 transmit anten- 
nas. However, by dropping the constraint that the generator matrix G be square, it is pos- 
sible to devise orthogonal designs for either one-dimensional or two-dimensional signal 
constellations. For example, an orthogonal generator matrix for a STBC that transmits four 
complex-valued (PSK or QAM) symbols on Nr = 4 transmit antennas is 


- -; 


S1 S2 S3 S4 
—S2 S1 —S4 S3 


G= s* st st gt (14.4.33) 
D 2, 83.6 4 
* * * * 
=s si s] s3 
* * * * 
—S3 s] ST -Š 
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Figure 14.22 Plot for Example 14.4.3. 


For this code generator, the four complex-valued symbols are transmitted in eight consec- 
utive time slots. Hence the spatial rate for this code is R, = 1/2. We also observe that 


4 
GiG= iti! I, (14.4.34) 


i=l 


so that this code provides fourth-order diversity in the case of one receive antenna and 4NR 
diversity with Nr receive antennas. 

Complex orthogonal matrices with rate Rẹ = 1/2 exist for any number of transmit 
antennas. However, Wang and Xia (2003) have shown that complex orthogonal matrices 
for rates R, > 3/4 do not exist. Rate R, = 3/4 complex orthogonal matrices do exist. The 
following Rs = 3/4 complex orthogonal generator matrix is designed for Ny = 3 transmit 
antennas in which the three symbols sı, s2, and s3 are transmitted in four time slots and 
determine the order of diversity achieved by the STBC. 


S1 82. 83 

_ |; st 0 
G= $0 ost (14.4.35) 

0 s =s} 


Example 14.4.4 


Express the input to the detector for the rate R, = 3/4 code having the generator matrix given 
by Equation (14.4.35) and determine the expression for the estimates $1, $2, and §; computed 
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by the detector. Assume that the channel coefficients c11, c12, C13 are time invariant over the 
four time slots and determine the order of diversity achieved by the STBC. 


Solution The inputs to the detector are the outputs of the correlators for the signals received 
in the four time slots, i.e., 


Yı = C1151 + C1252 + 1353 + Ni 
Yo = —Ci153 + cast +m 

y3 = cs} — CST + m 

Y4 = C1283 — C85 + N4 


or, equivalently, 


yı Ci Cn C13 $ nı 
1 
zj |c -ù 0 nz 
* | | _p* 0 * s2 + 
y3 C13 ah S n3 
yí 0 —=ch ch n4 
y=C3s+9 


It is easily verified that the symbol estimates are obtained from the following linear combina- 
tions of y1, Y2, Y3, Ya: 


“a * * * 
Sı = Cji Yi + Cryo — C133 
a * * * 
S2 = C121 — C12 — C134 
A * 
$3 = cayi + cn y3 + C124 


or, in matrix form, 
^ H 
S = C319. 


By substituting in the equation for y we obtain 
s= CË Cais + Cin 
= [len + lel? + leisi?] 8 + Csin- 


Therefore, this STBC results in third-order diversity. | 


Space-Time Trellis Codes. Space-time trellis codes (STTCs) are similar to trel- 
lis codes in trellis-coded modulation (TCM) in that they are formed by combining a trellis 
code with an appropriately selected signal constellation with the objective of achieving 
a coding gain. In the case of a space-time code, the primary objective is to achieve the 
largest possible spatial diversity at the highest coding rate. STTCs may be designed either 
manually or with the aid of a computer, following some simple rules, similar in nature to 
the rules for designing trellis codes for TCM. 

As an example of a STTC, we consider the four-state trellis code shown in Figure 
14.23, which is designed for two transmit antennas and QPSK modulation. The states are 
denoted as S, = 0, 1, 2, 3. The input to the encoder is a pair of bits (00, 01, 10, 11) that 
are mapped into the corresponding phases, which are numbered (0, 1, 2, 3), respectively. 
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The indices 0, 1, 2, 3 correspond to the four phases, which are called symbols. Initially, 
the encoder is in state S, = 0. Then, for each pair of input bits that are mapped into 
a corresponding symbol, the encoder generates a pair of symbols, the first of which is 
transmitted on the first antenna and the second of which is transmitted simultaneously on 
the second antenna. For example, when the encoder is in state S, = 0 and the input bits are 
11, the symbol is a 3. The STTC outputs the pair of symbols (0, 3), corresponding to the 
phases 0 and 31/2. The zero phase signal is transmitted in the first antenna and the 31/2 
phase signal is transmitted on the second antenna. At this point the encoder goes to state 
Sı = 3. If the next two input bits are 01, the encoder outputs the symbols (3, 1), which are 
transmitted on the two antennas. Then the encoder goes to state S, = 1, and this procedure _ 
continues. At the end of a block of input bits, say a frame of data, zeros are inserted in the 
data stream to return the encoder to the state S, = 0. Thus the STTC transmits at a bit rate 
of 2 bps/Hz. We note that it satisfies the two design rules given above and achieves full 
rank of Nr = 2. 

In decoding a STTC, the maximum-likelihood sequence detection (MLSD) criterion 
provides the optimum performance. MLSD is efficiently implemented by use of the Viterbi 
algorithm. For two transmit antennas., the branch metrics may be expressed as 


NR 


olsi, 82) = È [Yj — cij 81 — Cay sal’, (14.4.36) 
j=l 


where {y;, 1 < j < Np} are the outputs of the matched filters at the Nr receive antennas, 
{c1j,1 < j < Nr} and {c;, 1 < j < Np} are the channel coefficients in a frequency nons- 
elective channel, and (s1, $2) denote the symbols transmitted on the two antennas. By using 
these branch metrics in the Viterbi algorithm to form the path metrics of valid paths through 
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the trellis, we can find the path that minimizes the overall metric and, thus, determine the 
sequence of transmitted symbols corresponding to the path having the smallest path metric. 


14.5 LINK BUDGET ANALYSIS FOR RADIO CHANNELS 


In the design of radio communication systems that transmit over LOS microwave satellite 
channels, we must also consider the effect of the antenna characteristics on determining the 
SNR at the receiver that is required to achieve a given level of performance. The system 
design procedure is described next. 

Suppose that a transmitting antenna radiates isotropically in free space at a power 
level Pr Watts, as shown in Figure 14.24. The power density at a distance d from the 
antenna is Pr /4nd? W/m”. If the transmitting antenna has directivity in a particular direc- 
tion, the power density in that direction is increased by a factor called the antenna gain 
Gr. Then, the power density at a distance d is PrGr/4nd? W/m?. The product PrGr 
is usually called the effective isotropically radiated power (EIRP), which is basically the 
radiated power relative to an isotropic antenna for which Gr = 1. 

A receiving antenna pointed in the direction of the radiated power gathers a portion 
of the power that is proportional to its cross-sectional area. Hence, the received power 
extracted by the antenna is expressed as 


p, _ PrGrAr 
R Gnd?’ 


where Ap is the effective area of the antenna. The basic relationship between the antenna 
gain Gr and its effective area, obtained from basic electromagnetic theory, is 


(14.5.1) 


G 2 
AR => RA , m? 
4n 


where A is the wavelength of the transmitted signal. 





(14.5.2) 





Figure14.24 Antenna that radiates isotropically in free space. 
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If we substitute for A g from Equation (14.5.2) into Equation (14.5.1), we obtain the 
expression for the received power: 


_ PrGrGr 


> Gait (14.5.3) 


R 


The factor (4nd/A)* = Z, is the free-space path loss. Other losses that may be encoun- 
tered in the transmission of the signal, such as atmospheric losses, are accounted for by 
introducing an additional loss factor %. Therefore, the received power may be 
expressed as 


PrGrGR 
Pr = — 14.5.4 
R ZE ( ) 
or, equivalently, 
Prlasw = Prlasw + Gries + Gries — Solan — Zalas- (14.5.5) 


The effective area for an antenna generally depends on the wavelength A of the radi- 
ated power and the physical dimension of the antenna. For example, a parabolic (dish) 
antenna of diameter D has an effective area 


xD? 
Ar ==, (14.5.6) 


where n D?/4 is the physical area and y is the illumination efficiency factor, which is typ- 
ically in the range 0.5 < n < 0.6. Hence, the antenna gain for a parabolic antenna of 
diameter D is 


D\2 
GR = (2) , fora parabolic antenna. (14.5.7) 


As a second example, a horn antenna of physical area A has an efficiency factor of 
0.8, an effective area of Ag = 0.8A, and a gain of 


Gr = cm for a horn antenna. (14.5.8) 

Another parameter that is related to the gain (directivity) of an antenna is its 
beamwidth, which is denoted as ©g and illustrated in Figure 14.25. Usually, the beamwidth 
is measured as the —3 dB width of the antenna pattern. For example, the —3 dB beamwidth 
of a parabolic antenna is approximately 


©ps ~ 70A/D deg, 


so that Gr is inversely proportional to @%. Hence, a decrease of the beamwidth by a factor 
of two, which is obtained by doubling the diameter, increases the antenna gain by a factor 
of four (6 dB). 
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Beamwidth @g 


(a) Beamwidth 


Figure 14.25 A narrow beam 
(b) Antenna pattern antenna and its radiating pattem. 


Example 14.5.1 


A satellite in geosynchronous orbit (36,000 km above the earth’s surface) radiates 100 W 
of power (20 dBW). The transmitting antenna has a gain of 18 dB, so that the EIRP = 38 
dBW. The earth station employs a 3-m parabolic antenna, and the downlink is transmitting at 
a frequency of 4 GHz. Determine the received power. 


Solution The wavelength à = 0.075 m. Hence, the free-space path loss is 
4nd 
Flan = 20log (=) = 195.6 dB. 


Assuming that 7 = 0.5, the antenna gain is 39 dB. Since no other losses are assumed, 
it follows that 


Prlasw = 20+ 18+ 39 — 195.6 
= —118.6 dBW, 
or equivalently, 
Pr = 1.38 x 10°? W. 
We may carry the computation one step further by relating the €,/No required to 
achieve a specified level of performance to Pr. Since 
Gp E T, PR = 1P R 
No No RN’ 





(14.5.9) 
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P 
Z-k (2) , (14.5.10) 
No No req 


where (6s /No)req is the required SNR/bit to achieve the desired performance. The relation in 
Equation (14.5.10) allows us to determine the bit rate R,. We have 


P 
10 logy Ry = (2) — 10 logig (2) ; (14.5.11) 
dB req 


it follows that 


Example 14.5.2 


If (Gs/No)req = 10 GB, determine the bit rate for the satellite communication system in 
Example 14.5.1. Assume that the front end of the receiver has a noise temperature of 300 K, 
which is typical for a receiver in the 4-GHz range. 


Solution Since J) = 290 K and T, = 10 K, it follows that 
No = kT = 4.1 x 10-7! W/Hz, 


or equivalently, —203.9 dBW/Hz. Then, 


(5) = —118.6 + 203.9. 
0/ dB 


Therefore, from Equation (14.5.11), we obtain 


10 logio Roe = 85.3 — 10 
= 75.3, 


or equivalently, 
R, = 33.9 x 10° bps. 


We conclude that this satellite channel can support a bit rate of 33.9 Mbps. a 


14.6 SUMMARY AND FURTHER READING 


In this chapter, we treated digital signal transmission in fading multipath channels. We 
began with the characterization of physical wireless channels and constructed mathemat- 
ical models for frequency selective and frequency nonselective fading channels. We also 
presented models for the Doppler effects that account for a frequency spread in the received 
signal due to motion that occurs in the physical channel or in a moving platform, such as 
a receiver in mobile communications. Propagation models for mobile radio channels that 
account for path loss and shadowing were also described. 

In Section 14.3, we evaluated the probability of error of binary modulation schemes 
in frequency nonselective Rayleigh fading channels with additive white Gaussian noise 
(AWGN). We observed that in such channels, the probability of error decreases inversely 
with signal-to-noise ratio (SNR), which means that a large SNR is required to achieve 
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a sufficiently small error probability. On the other hand, we demonstrated that the SNR 
required to achieve good performance can be significantly reduced by signal diversity, 
which can be achieved by transmitting the same symbol on multiple independently fading 
channels. We also demonstrated that signal diversity is achieved in a more bandwidth- 
efficient manner by employing coding, provided that the coded symbols are interleaved in 
either time or frequency or both, so that they undergo independent fading. For frequency 
selective fading channels, we described the RAKE demodulator, which is optimum in the 
sense of collecting and summing the energies in the resolvable multipath signal compo- 
nents. Thus, the RAKE demodulator achieves signal diversity from the processing of the 
resolvable signal components in the received signal. In this section, we also demonstrated 
that OFDM may be used instead of single carrier transmission in a frequency selective 
channel. The primary objective in employing OFDM rather than single carrier is that each 
subchannel in OFDM is designed to be sufficiently narrowband so that it becomes fre- 
quency nonselective. Then, to combat signal fading on the individual subchannels, the 
same symbol is transmitted on two or more subchannels separated in frequency by the 
coherence bandwidth of the channel. Thus, signal diversity is achieved. 

Our evaluation of the error rate performance of binary modulation techniques in fad- 
ing channels was focused on the Rayleigh fading channel model. Although other statistical 
models, such as the Rician and Nakagami fading models, may be more appropriate for 
characterizing fading on some real channels, the general approach for designing reliable 
digital communications via signal diversity is still applicable. 

In Section 14.4, we introduced the reader to the use of multiple transmit and receive 
antennas for increasing the transmission rate and obtaining signal diversity through spa- 
tial multiplexing in wireless communication systems. In particular, we presented detection 
methods for use in multiple antenna systems and evaluated their performance in Rayleigh 
fading channels. Methods for mapping digital signals for transmission on multiple anten- 
nas were also presented, including space-time block codes, such as the Alamouti code, and 
trellis codes. 

The final topic treated in this chapter is link budget analysis for digital communica- 
tion on radio channels. The link budget analysis described in the last section of this chapter 
applies generally to free space line-of-sight (LOS) channels and is especially relevant to 
the design of LOS microwave radio communications and satellite communication systems. 
Radio propagation in non-LOS terrestrial channels is significantly more variable due to 
terrain characteristics and, consequently, signal attenuation is generally greater than in free 
space. For example, the modeling of path losses in cellular radio communications is treated 
in the book by Rappaport (1996). 

More extensive and advanced treatments in fading multipath channels are found in 
the books by Schwartz et al. (1966) and Proakis and Salahi (2008). The pioneering work on 
the characterization of fading multipath channels and on the signal and receiver design for 
reliable digital communication over such channels was done by Price (1954, 1956). This 
early work was followed by significant contributions from Price and Green (1958, 1960), 
Kailath (1960, 1961), and Green (1962). Diversity transmission and diversity combining 
techniques under a variety of channel conditions have been treated by Pierce (1958), Bren- 
nan (1959), Turin (1961, 1962), Pierce and Stein (1960), Barrow (1963), Bello and Nellin 
(1962a, 1962b, 1963), Price (1962a, 1962b), and Lindsey (1964). 
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PROBLEMS 


14.1 


14.2 


14.3 


In the transmission and reception of signals to and from moving vehicles, the trans- 
mitted signal frequency is shifted in direct proportion to the speed of the vehicle. The 
so-called Doppler frequency shift imparted to a signal that is received in a vehicle 
traveling at a velocity v relative to a (fixed) transmitter is given by the formula 


Vv 
fp =bn 


where A is the wavelength, and the sign depends on the direction (moving toward or 
moving away) that the vehicle is traveling relative to the transmitter. Suppose that a 
vehicle is traveling at a speed of 100 km/hour relative to a base station in a mobile 
cellular communication system. The signal is a narrowband signal transmitted at a 
carrier frequency of 1 GHz. 


1. Determine the Doppler frequency shift. 


2. What should be the bandwidth of a Doppler frequency tracking loop if the loop 
is designed to track Doppler frequency shifts for vehicles traveling at speeds up 
to 100 km/hr? 


3. Suppose the transmitted signal bandwidth is 1 MHz centered at 1 GHz. Deter- 
mine the Doppler frequency spread between the upper and lower frequencies in 
the signal. 


A multipath fading channel has a multipath spread of T,, = 1 sec and a Doppler 
spread Ba = 0.01 Hz. The total channel bandwidth at bandpass available for signal 
transmission is W = 5 Hz. To reduce the effect of intersymbol interference, the 
signal designer selects a pulse duration of T = 10 sec. 


. Determine the coherence bandwidth and the coherence time. 
. Is the channel frequency selective? Explain. 


. Is the channel fading slowly or rapidly? Explain. 


Aa U N = 


. Suppose that the channel is used to transmit binary data via (antipodal) coher- 
ently detected PSK in a frequency diversity mode. Explain how you would use 
the available channel bandwidth to obtain frequency diversity and determine 
how much diversity is available. 


5. For the case in (4), what is the approximate SNR required/diversity to achieve 
an error probability of 1076? 


Determine an appropriate channel model for an airplane-to-airplane communica- 
tion link in which there is a direct signal propagation path, and a secondary prop- 
agation resulting from signal scattering due to the surrounding ground terrain. The 
secondary path has a propagation delay of to = 10 usec relative to the propagation 
delay of the direct path. The signal bandwidth is W = 100 kHz. 
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Determine the appropriate channel model for the airplane-to-airplane communica- 
tion link described in Problem 14.3, but now assume that the transmitted signal 
bandwidth is 10 kHz. 


The probability of error for binary DPSK and binary FSK with noncoherent detec- 
tion in an AWGN channel is 


1 
P, = —e >, 
aa, 





where c = 1 for DPSK and c= 5 for FSK, and p, = ae , where g is the attenuation 
factor. By averaging Pp, over the Rayleigh distributed variable œ, as indicated by 
Equation (14.3.9), verify the expression for the probability of error for DPSK and 


FSK in a Rayleigh fading channel. 


A communication system employs dual antenna diversity and binary orthogonal 
FSK modulation. The received signals at the two antennae are 


nt) =s t) +n (t), 
n (t) = œs (t) +m (t), 


where a and œz are statistically i.i.d. Rayleigh random variables, and nı (t) and 
n2(t) are statistically independent, zero-mean white Gaussian random processes 
with power-spectral density No /2 W/Hz. The two signals are demodulated, squared, 
and then combined (summed) prior to detection. 


1. Sketch the functional block diagram of the entire receiver including the demod- 
ulator, the combiner, and the detector. 


2. Plot the probability of error for the detector and compare the result with the case 
of no diversity. 


A binary communication system transmits the same information on two diversity 
channels. The two received signals are 


ri = ŁV6p +M, 
ro = EV 6p + M, 


where E (nı) = E(n2) = 0, E(n?) = o? and E (nî) = 02, and n andn are uncor- 
related Gaussian variables. The detector bases its decision on the linear combination 
of rı and rg, i.e., 


r=rnt+kro. 


1. Determine the value of k that minimizes the probability of error. 


2. Plot the probability of error for o? = 1, of = 3 and either k = 1 or k is the 
optimum value found in Part 1. Compare the results. 


Problems : 817 


14.8 


14.9 


Suppose the binary antipodal signals +s(t) are transmitted over a fading channel 
and the received signal is 


r(t)=cttas(t)+n(t), O<t<T, 


where n(t) is zero-mean white Gaussian noise with autocorrelation function Ms (T). 


The energy in the transmitted signal is 8 = fý |s(t)|? dt. The channel gain a is 
specified by the PDF p(w) =0.15(a@) + 0.96(@ — 2). 


1. Determine the average probability of error P, forthe demodulator which employs 
a filter matched to s(t). 


2. What value does P, approach as approaches infinity? 


3. Suppose.the same signal is transmitted over two statistically independently fad- 
ing channels with gains a and œ2, where 


plak) = 0.16 (a) + 0.96(a, — 2), k=1,2. 


The noises on the two channels are statistically independent and identically dis- 
tributed. The demodulator employs a matched filter for each channel and simply 
adds the two filter outputs to form the decision variable. Determine the aver- 
age P}. 


4. For the case in (3) what value does P, approach as a approaches infinity? 


Consider the frequency nonselective channel model shown in Figure 14.4. Binary 
orthogonal signals u; (t) and u2(t) are used to transmit information over this chan- 
nel. The symbol duration Tp « Ter, so that the channel is assumed to be constant 
over a time interval (time window) of NT,, where N is a positive integer. The out- 
puts of the matched filters at the sampling instants are either 


Yık = cv Sy + 1K 


or 


Y2k = Nk, 


when ux is transmitted or 


yık = Nik 
Yak = CY Êb + Nak 


when u(t) is transmitted. c = ae/® is the complex-valued channel coefficient, 
which is assumed to be constant over the time window 0 < t < NTp. 


To detect the signal coherently over the time window, the channel coefficient c is 
estimated by averaging the matched filter outputs for 1 < k < N. Thus, the esti- 
mate is 
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1 


NVEp 


1 N 
Ne [ev B + mu +m, 


Assuming that the noise terms are statistically independent, zero-mean Gaussian 
random variables with equal variance o? = No /2, (a) show that the mean value of 
ĉ is the actual channel coefficient c, and (b) show that the variance of the estimate 
decreases as N is increased. (c) If o? is defined as the variance of the estimate and 
1/a2 is defined as the SNR of the estimate, determine the expression for 1/02 and 
thus show that the SNR increases linearly with N and with €». 





c= 


N 
X One + yar) 
k=] 





Suppose that an HF channel with a nominal bandwidth allocation of 3200 Hz is to 
be used for transmitting digital information at a rate of either (1) 4800 bits/sec or 
(2) 20 bits/sec. The channel multipath spread is T,,, = 5 msec. Specify a modulation 
method for achieving the desired data rates and indicate whether or not an equalizer 
is necessary at the receiver for the intersymbol interference. 


Repeat Example 14.2.6 for a train traveling at a speed of 200 km/hour and a carrier 
frequency of 1 GHz. 


Repeat Example 14.3.2 for a channel with bandwidth W = 800 kHz, multipath 
spread Tm = 10 us and a Doppler spread Bg = 10 Hz. 


Show that the conditions in (14.3.34) and (14.3.35) imply that the time-bandwidth 
product TW > 1. 


The generator matrix for a MISO system with (Nr, Nr) = (4, 1) antennas is given 
as follows: 


S 2 83 0 
* ox 
Ga| 72 si 0 83 
s} 0 sf s 
0 sz =s} -s 


Thus, this STBC results in a spatial rate R, = 3/4. The channel matrix for this 
system consists of the elements cj, €12, €13, and c14. Show that this generator matrix 
is orthogonal and results in a fourth-order diversity when used in a fading channel. 


A radio transmitter has a power output of Pr = 1 Watt at a frequency of 10° Hz 
(1 GHz). The transmitting and receiving antennas are parabolic dishes with a diam- 
eter D=3 m. 

1. Determine the antenna gains. 

2. Determine the EIRP for the transmitter. 


3. The distance (free space) between the transmitting and receiving antennas is 
20 km. Determine the signal power at the output of the receiving antenna in dBm. 
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14.16 A radio communication system transmits at a power level of 0.1 Watt at 1 GHz. The 
transmitting and receiving antennas are parabolic, and each has a diameter of 1 m. 
The receiver is located 30 km from the transmitter. 


1. Determine the gains of the transmitting and receiving antennas. 
2. Determine the EIRP of the transmitted signal. 


3. Determine the signal power from the receiving antenna. 


14.17 A satellite in synchronous orbit is used to communicate with an earth station at a 
distance of 4 x 10’ m. The satellite has an antenna with a gain of 15 dB anda 
transmitter power of 3 W. The earth station uses a 10-m parabolic antenna with an 
efficiency of 0.6. The frequency band is at f = 10 GHz. Determine the received 
power level at the output of the receiver antenna. 


14.18 A spacecraft located 108 m from the earth is sending data at a rate of R bps. The 
frequency band is centered at 2 GHz and the transmitted power is 10 W. The earth 
station uses a parabolic antenna 50 m in diameter, and the spacecraft has an antenna 
with a gain of 10 dB. The noise temperature of the receiver front end is T = 300° K. 


1. Determine the received power level. 


2. If the desired €,/No = 10 dB, determine the maximum bit rate that the space- 
craft can transmit. 


14.19 Consider the front end of the receiver that is shown in the block diagram in Figure 
P-14.19. The received signal power at the input to the first amplifier is —113 dBm, 
and the received noise power spectral density is —175 dBm/Hz. The bandpass filter 
has a bandwidth of 10 MHz, and gains and noise figures are as shown. Determine 
the signal-to-noise ratio P,/P, at the input to the demodulator. 


Bandpass To 
filter demodulator 











G=10dB G = —1 dB G = 25 dB 
F=5dB —___ | F=2dB F=5dB 
Local 
oscillator 
Figure P-14.19 


14.20 A satellite in geosynchronous orbit is used as a regenerative repeater in a digital 
communication system. Consider the satellite-to-earth link in which the satellite 
antenna has a gain of 6 dB and the earth station antenna has a gain of 50 dB. The 
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downlink is operated at a center frequency of 4 GHz, and the signal bandwidth is 1 
MHz. If the required (€,/No) for reliable communication is 15 dB, determine the 
transmitted power for the satellite downlink. Assume that No = 4.1 x 10-2! W/Hz. 


14.21 One of the Mariner spacecrafts that traveled to the planet Mercury sent its data 
to earth over a distance of 1.6 x 10!! m. The transmitting antenna had a gain of 
27 dB and operated at a frequency f = 2.3 GHz. The transmitter power was 17 W. 
The earth station employed a parabolic antenna with a 64-meter diameter and an 
efficiency of 0.55. The receiver had an effective noise temperature of T, = 15° K. 
If the desired SNR/bit (€,/No) was 6 dB, determine the data rate that could have 
been supported by the communication link. 


COMPUTER PROBLEMS 


14.1 Simulation of a Two-Path Rayleigh Fading Channel 


Figure CP-14.1 illustrates a two-path channel model where the time delay between 
the two paths is Tz. The tap weights cı[n] and c2[n] are statistically independent, 
zero-mean, complex-valued Gaussian random variables. Hence, the channel is a 
Rayleigh fading channel. The additive noise is a zero-mean, complex-valued white 
Gaussian noise sequence w[n] = w,[n] + jwi[n], where the real and imaginary 
components are statistically independent. Write a MATLAB program that simulates 
the channel model with the following conditions: 


1. The tap weight sequences cı[n] and c2[n] are the outputs of two identical low- 
pass filters described by the difference equation 


cin] = 0.9c[n — 1] + zk[n] k= 1,2, 


where z,[7] = zę [n] + jzki[n] and the real and imaginary components of z,[n] 
are statistically independent, zero-mean, and unit variance white Gaussian noise 
sequences. 

2. The additive white Gaussian noise sequence has zero mean and variance oĉ 
(a variable). 

3. The time delay T4 is a positive integer, which is also a variable. 


Simulate the appropriate channel model for a two-path ionospheric propagation 
channel in which the relative time delay between the two received signal paths is 
1 msec and the transmitted signal bandwidth is W = 10 kHz. Note that a 10 kHz 
signal provides a time resolution of 1/W = 0.1 msec, so that the time delay Tz = 10 
signal samples corresponds to the delay of 1 msec. 


Generate and plot {c;[m[} and {c2[n[} separately for 1 < n < 1000. Compute and 
plot the channel output {y[n[} when the input sequence x[n] = 1 for 1 < n < 1000, 
for each value of ø = 0, 0.5, 1. 


Repeat this simulation when the signal bandwidth is reduced to 5 kHz. 
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Additive noise 
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Figure CP-14.1 


14.2 


14.3 


Simulation of Antipodal Signals in Rayleigh Fading Channel 


Perform a Monte Carlo simulation to estimate and plot the error probability of 
a binary antipodal signaling communication system in frequency-nonselective 
Rayleigh fading. Thus, simulate the input to the detector as given in Equation 
(14.3.7), where the additive noise is Gaussian, with zero mean and unit variance and 
a Rayleigh distributed with ø? selected to be unity. Then vary the average SNR p, 
in the simulation. Plot the estimated error probability and the theoretical value given 
in Equation (14.3.10). Select the sample size for the simulation to be N = 10,000. 


Simulation of Orthogonal Signals in Rayleigh Fading with Dual Diversity 


A digital communication system for transmitting information through a frequency- 
nonselective Rayleigh fading AWGN channel employs dual diversity by transmit- 
ting each information bit on two carrier frequencies having a separation that exceeds 
the coherence bandwidth of the channel. Thus, the two signals fade independently. 
The signals used for transmission on each carrier frequency are orthogonal. There- 
fore, the correlator outputs for the orthogonal signals, when the transmitted infor- 
mation bit is a 1, is 

ri = aiy Bpel +n, 

ri2 = ni2 


and 
ra = aay Eze!” + no, 
r22 = N22, 


where œ; and œ are statistically independent Rayleigh distributed random variables 
and {n;;,i = 1, 2, j = 1, 2} are mutually statistically independent complex-valued 
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Gaussian random variables with zero mean and unit variance. The correlator per- 
forms square law combining of the two signals, that is, it computes 


2 2 

Ry = |rul’ + Irail’, 
2 2 

R2 = |nial* + Iraal 


and feeds Rı and R, to the detector, which decides in favor of the detected bit 
corresponding to the larger of (Ri, R2). Perform a Monte Carlo simulation to esti- 
mate and plot the error probability for the dual diversity system as a function of 
the SNR/bit in dB. Select the sample size for the simulation to be N = 100,000. 
For comparison, also plot the theoretical values of the error probability given by 
Equation (14.3.23) for large SNR, say ð > 15 dB. Note that 9, = Dp, where D is 
the order of diversity. 


Simulation of Rayleigh Distributed Random Variables 

Repeat Example 14.2.3 when the parameter o? in the Rayleigh distribution takes on 
the values o? = 1, 5, 10. 

RAKE Demodulator for Two-Path Channel 


Perform a Monte Carlo simulation to estimate and plot the error probability of a 
binary antipodal signaling communication system in which the channel is charac- 
terized by two resolvable Rayleigh fading signal paths. Thus, the received signal in 
the interval 0 < t < T is, for a slowly fading channel, 


r(t) = cis (t) + cs (t —1/W) +n(t), 


where cı and c are uncorrelated, complex-valued Gaussian random variables with 
zero mean and unit variance, and n(t) is a complex-valued AWGN process. The 
received signal is passed through a RAKE demodulator that cross correlates r(t) 
with s(t) and s(t — 1/ W). Consequently, the output of the correlators may be 
expressed as 


rr=cjVé,cosmn+n, m=0,1, 
r=c2/E,cosmin+n2, m=0,1, 


where the noise terms nı and nz are uncorrelated complex-valued Gaussian random 
variables with zero mean and identical variance o? = 1. Suppose that the receiver 
has perfect estimates of cı and c2 and computes the decision variable at the input to 
the detector as 


R=Re [efri + c3r2] 
= +,/8, [Ici l? + |ca|?] cosmn + Re [cfm + cžm] 
= (a; + «3)./8,cosmn +n, m=0,1. 
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Therefore, the Monte Carlo simulation may be performed by generating the deci- 
sion variable R at the input to the detector, with a and a2 being two statisti- 
cally independent and identically distributed Rayleigh random variables and n being 
a real-valued zero-mean Gaussian random variable with unit variance. Note that 
the {c;} are independently selected in each symbol transmitted. Compare the esti- 
mated error probability from the Monte Carlo simulation with the theoretical (large 
SNR) results given by Equation (14.3.32). Perform the simulation for N = 100,000 
sample. 


Generation of Channel Matrix for MIMO Systems 


Write the MATLAB code for generating the channel matrix C of a MIMO system 
that employs Nr transmit and Np receive antennas, where the channel is frequency 
nonselective and slowly fading, and elements of C are zero-mean, complex-valued 
Gaussian with identical variances equal to unity. 


Generation of the Channel Matrix and Inputs to the Detector 


Suppose Nr = Nr = 2. Generate the elements of the channel matrix C for a 
Rayleigh fading AWGN channel and the corresponding inputs to the detectors at 
the two receive antennas. 


Implementation of MIMO Detectors 
Implement the three types of detectors described in Section 14.4.3 in MATLAB. 


Simulation of an Nr = 2, Nr = 1 MISO System with Alamouti Code 


Perform a Monte Carlo simulation to estimate the error rate performance of an 
Nr = 2, Nr = 1 multiple antenna system that employs the Alamouti STBC. 
Hence, generate the input to the detector as given in Equation (14.4.25), where the 
signal points are selected from a QPSK constellation; ci; and c12 are statistically 
independent, complex-valued, zero-mean Gaussian random variables with unit vari- 
ance; and 7; and 72 are also statistically independent, complex-valued, zero-mean 
Gaussian random variables with variance o?. The detector computes the estimates 
as in Equation (14.4.26) and decides on which symbols are closest to $4 and $, in 
Euclidean distance. Perform the above computations for N = 10,000 iterations for 
any given value of o*, where in each iteration the channel coefficients (c11, c12), the 
signal points (s1, 52), and, additive noise terms (71, 72) are selected independently. 
Plot the measured symbol error rate as a function of the SNR = 10 log,(€,/207), 
where €, = €;/2 is the energy per bit, which may be normalized to unity for 
convenience. 


Simulation of MIMO Systems 


Suppose we add a second receive antenna to the system in Computer Problem 14.9 
that employs the Alamouti code. The channel coefficients for the received signal at 
the second antenna are c2; and c22. Simulate the (Nr, Nr) = (2, 2) MIMO systems 
in a Rayleigh fading AWGN channel and plot the error probability as a function 
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of the SNR/bit. Thus, demonstrate that this MIMO system achieves an order of 
diversity of NrNr = 4. 


14.11 Simulation of Space-Time Trellis Code 


Write the MATLAB code for implementing the 4-PSK, four-state trellis encoder 
shown in Figure 14.23. 
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In our treatment of signal design for digital communication over an additive white Gaussian 
noise (AWGN) channel, the major objective has been the efficient utilization of transmit- 
ter power and channel bandwidth. As shown in Chapter 13, channel coding allows us to 
reduce the transmitter power by increasing the transmitted signal bandwidth through code 
redundancy; this allows us to trade off transmitter power with channel bandwidth. To be 
specific, let R denote the information (bit) rate at the input to the transmitter and let W 
denote the channel bandwidth. We also define the ratio W/R as the bandwidth expansion 
factor, denoted as Be, of the channel coded transmitted signal. The factor Be represents 
the amount of redundancy introduced through channel coding. Thus, by increasing B,, we 
can reduce the power in the transmitted signal that is required to achieve a specific level of 
performance. In most practical communication systems, B, is in the range of 2 < Be <5. 
This is the basic methodology for the design of digital communication systems for AWGN 
channels. 

In practice, other factors influence the design of an efficient digital communication 
system. For example, in multiple-access communication when two or more transmitters 
use the same common channel to transmit information, the interference created by the 
users of the channel limits the performance achieved by the system. The existence of such 
interference must be considered in the design of a reliable digital communication system. 

Even in this complex design problem, the basic system design parameters are trans- 
mitter power and channel bandwidth. To overcome the degradation in performance caused 
by interference, we may further increase the bandwidth of the transmitted signal so that the 
bandwidth expansion factor Be = W/R is much greater than unity. This is one character- 
istic of a spread-spectrum signal. A second important characteristic is that the information 
signal at the modulator is spread in bandwidth by means of a code that is independent of the 
information sequence. This code has the property of being pseudorandom, i.e., it appears 
random to receivers other than the intended receiver, which uses the knowledge of the 
code to demodulate the signal. This second characteristic distinguishes a spread-spectrum 
communication system from the conventional communication system that expands the 
transmitted signal bandwidth by means of channel code redundancy, as described in 
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Chapter 13. However, channel coding is an important element in the design of an efficient 
spread-spectrum communication system. 

Spread-spectrum signals for digital communications were originally developed and 
used for military communications either (a) to provide resistance to jamming (antijam pro- 
tection), or (b) to hide the signal by transmitting it at low power, which made it difficult 
for an unintended listener to detect its presence in noise (low probability of intercept). 
However, spread-spectrum signals now provide reliable communications in a variety of 
commercial applications, including digital cellular communications, cordless telephones, 
and interoffice wireless communications. 

In this chapter, we present the basic characteristics of spread-spectrum signals and 
assess their performance in terms of probability of error. We concentrate our discussion on 
two methods for spreading the signal bandwidth, namely, by direct sequence modulation 
and by frequency hopping. Both methods require the use of pseudorandom code sequences 
whose generation is also described. Several applications of spread-spectrum signals are 
presented. 


15.1 MODEL OF A SPREAD-SPECTRUM DIGITAL COMMUNICATION SYSTEM 


The basic elements of a spread-spectrum digital communication system are illustrated 
in Figure 15.1. The channel encoder and decoder and the modulator and demodulator 
are the basic elements of a conventional digital communication system. In addition, a 
spread-spectrum system employs two identical pseudorandom sequence generators, one 
that interfaces with the modulator at the transmitting end and one that interfaces with 
the demodulator at the receiving end. These two generators produce a pseudorandom or 
pseudonoise (PN) binary-valued sequence, which is used to spread the transmitted signal 
at the modulator and to despread the received signal at the demodulator. 

Time synchronization of the PN sequence generated at the receiver with the PN 
sequence contained in the received signal is required to properly despread the received 
spread-spectrum signal. In a practical system, synchronization is established prior to the 
transmission of information; this is achieved by transmitting a fixed PN bit pattern, which 
is designed so that the receiver will detect it with high probability in the presence of inter- 
ference. After time synchronization of the PN sequence generators is established, the trans- 
mission of information commences. In the data mode, the communication system usually 
tracks the timing of the incoming received signal and keeps the PN sequence generator in 


synchronism. 
Information Output 
sequence data 
Channel Modulator Channel Demodulator Channel 
encoder | decoder | 


=i = 


pattern 
generator 


pattern 
generator 





Figure 15.1 Model of a spread-spectrum digital communication system. 
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Interference is introduced in the transmission of the spread-spectrum signal through 
the channel. The characteristics of the interference depend to a large extent on its origin. 
The interference may be generally categorized as either broadband or narrowband (partial 
band) relative to the bandwidth of the information-bearing signal and either continuous in 
time or discontinuous (pulsed) in time. For example, an interfering signal may consist of 
a high-power sinusoid in the bandwidth occupied by the information-bearing signal. Such 
a signal is narrowband. As a second example, the interference generated by other users 
in a multiple-access channel depends on the type of spread-spectrum signals that are used 
to transmit information. If all users employ broadband signals, the interference may be 
characterized as an equivalent broadband noise. If the users employ frequency hopping to 
generate spread-spectrum signals, the interference from other users may be characterized 
as narrowband. 

Our discussion will focus on the performance of spread-spectrum signals for digi- 
tal communication in the presence of narrowband and broadband interference. Two types 
of digital modulation are considered, namely, phase-shift keying (PSK) and frequency- 
shift keying (FSK). PSK modulation is appropriate for applications where phase coher- 
ence between the transmitted signal and the received signal can be maintained over a time 
interval that spans several symbol (or bit) intervals. On the other hand, FSK modulation is 
appropriate in applications where phase coherence of the carrier cannot be maintained due 
to time variations in the transmission characteristics of the communications channel. For 
example, this may be the case in a communications link between two high-speed aircraft 
or between a high-speed aircraft and a ground-based terminal. 

The PN sequence generated at the modulator is used in conjunction with the PSK 
modulation to shift the phase of the PSK signal pseudorandomly at a rate that is an integer 
multiple of the bit rate. The resulting modulated signal is called a direct sequence (DS) 
spread-spectrum signal. When used in conjunction with binary or M-ary (M > 2) FSK, 
the PN sequence is used to select the frequency of the transmitted signal pseudorandomly. 
The resulting signal is called a frequency-hopped (FH) spread-spectrum signal. Although 
other types of spread-spectrum signals can be generated, our treatment will emphasize DS 
and FH spread-spectrum communication systems, which are generally used in practice. 


15.2 DIRECT SEQUENCE SPREAD-SPECTRUM SYSTEMS 


Consider the transmission of a binary information sequence by means of binary PSK. The 
information rate is R bits per second, and the bit interval is T, = 1/R seconds. The avail- 
able channel bandwidth is B, Hz, where B, >> R. At the modulator, the bandwidth of the 
information signal is expanded to W = B, Hz by shifting the phase of the carrier pseudo- 
randomly at arate of W times per second according to the pattern of the PN generator. The 
basic method for accomplishing the spreading is shown in Figure 15.2. 

The information-bearing baseband signal is denoted as v(t) and is expressed as 


foe} 


vt) = J. angr(t — nū), (15.2.1) 


n=—oo 
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c(t) } 











-1 
(a) PN signal 
v(t) 
(b) Data signal 
v(t) c(t) 
+1 





ae 


(c) Product signal 


Figure 15.2 Generation of a DS spread-spectrum signal. 


where {a, = +1, —co < n < oo} and g7(ft) is a rectangular pulse of duration 7}. This sig- 
nal is multiplied by the signal from the PN sequence generator, which may be expressed as 
foe} 
c(t)= ` caplt—nT;), (15.2.2) 
n=—00 P 
where {c„} represents the binary PN code sequence of +1’s and p(t) is a rectangular pulse 
of duration T,, as illustrated in Figure 15.2. This multiplication operation serves to spread 
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Figure 15.3 Convolution of the spectra of the (a) data signal with the (b) PN code signal. 


the bandwidth of the information-bearing signal (whose bandwidth is approximately R 
Hz) into the wider bandwidth occupied by PN generator signal c(t) (whose bandwidth 
is approximately 1/T,). The spectrum spreading is illustrated in Figure 15.3, which uses 
simple rectangular spectra to show the convolution of the two spectra, the narrow spectrum 
corresponding to the information-bearing signal and the wide spectrum corresponding to 
the signal from the PN generator. 

The product signal v(t)c(t) amplitude modulates the carrier A, cos 2x ft and gen- 
erates the double-sideband suppressed-carrier (DSB-SC) signal 


u(t) = A-v(t)e(t) cos2n fet. (15.2.3) 


Since v(t)c(t) = +1 for any ż, it follows that the carrier modulated transmitted signal may 
also be expressed as 


u(t) = A, cos[2n fet + O(t)], (15.2.4) 
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where @(t) = 0 when v(t)c(t) = 1 and 6(t) = x when v(t)c(t) = —1. Therefore, the 
transmitted signal is a binary PSK signal. 

The rectangular pulse p(t) is usually called a chip, and its time duration Te is called 
the chip interval. The reciprocal 1/T, is called the chip rate and corresponds (approxi- 
mately) to the bandwidth W of the transmitted signal. In practical spread-spectrum sys- 
tems, the ratio of the bit interval T, to the chip interval Te is usually selected to be an 
integer. We denote this ratio as 


AR (15.2.5) 


T, 
Te 
Hence, L, is the number of chips of the PN code sequence per information bit. Another 
interpretation is that Le represents the number of possible 180° phase transitions in the 
transmitted signal during the bit interval T. 

The demodulation of the signal is illustrated in Figure 15.4. The received signal is 
first multiplied by a replica of the waveform c(t) generated by the PN code sequence 
generator at the receiver, which is synchronized to the PN code in the received signal. This 
operation is called (spectrum) despreading, since the effect of multiplication by c(t) at the 
receiver is to undo the spreading operation at the transmitter. Thus, we have 


A-v(t)c?(t) cos 2n fot = A-v(t) cos 27 fot, (15.2.6) 


since c?(t) = 1 for all t. The resulting signal Acv(t) cos 27 fet occupies a bandwidth 
(approximately) of R Hz, which is the bandwidth of the information-bearing signal. There- 
fore, the demodulator for the despread signal is simply the conventional cross correlator or 
matched filter that was described in Chapter 8. Since the demodulator has a bandwidth that 
is identical to the bandwidth of the despread signal, the only additive noise that corrupts 
the signal at the demodulator is the noise that falls within the information bandwidth of the 
received signal. 


15.2.1 Effect of Despreading on a Narrowband Interference 


It is interesting to investigate the effect of an interfering signal on the demodulation of the 
desired information-bearing signal. Suppose that the received signal is 


r(t) = ADE cos 2x f-t + i(t), (15.2.7) 
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where i(t) denotes the interference. The despreading operation at the receiver yields 
r(t)c(t) = Acv(t) cos 27 fet + i(t)c(t). (15.2.8) 


The effect of multiplying the interference i(t) with c(t) is to spread the bandwidth of i(t) 
to W Hz. 
As an example, consider the sinusoidal interfering signal 


i(t) = Arcos 27 fit, (15.2.9) 


where f7 is a frequency within the bandwidth of the transmitted signal. Its multiplication 
with c(t) results in a wideband interference with power spectral density lọ = P/W, 
where P; = A? /2 is the average power of the interference. Since the desired signal is 
demodulated by a matched filter (or correlator) that has a bandwidth R, the total power in 
the interference at the output of the demodulator is 


P P P 
bR, = P R/W = — = — = 2, (15.2.10) 
W/R To/Te Le 


Therefore, the power in the interfering signal is reduced by an amount equal to the band- 
width expansion factor W/R. The factor W/R = T,/Te = Le is called the processing 
gain of the spread-spectrum system. The reduction in interference power is the basic rea- 
son for using spread-spectrum signals to transmit digital information over channels with 
interference. 

In summary, the PN code sequence is used at the transmitter to spread the information- 
bearing signal into a wide bandwidth for transmission over the channel. By multiplying 
the received signal with a synchronized replica of the PN code signal, the desired signal 
is despread back to a narrow bandwidth while any interference signals are spread over a 
wide bandwidth. The net effect is a reduction in the interference power by the factor W/R, 
which is the processing gain of the spread-spectrum system. 

The PN code sequence {cn} is assumed to be known only to the intended receiver. 
Any other receiver that does not have knowledge of the PN code sequence cannot demod- 
ulate the signal. Consequently, the use of a PN code sequence provides a degree of privacy 
(or security) that is not possible with conventional modulation. The primary cost for this 
security and performance gain against interference is an increase in channel bandwidth 
utilization and in the complexity of the communication system. 


15.2.2 Probability of Error at the Detector 
To derive the probability of error for a direct sequence spread-spectrum system, we assume 
that the information is transmitted via binary PSK. Within the bit interval 0 < t < Tp, the 


transmitted signal is 


S(t) = aogr(t)c(t) cos2nfct, O<t < Th, (15.2.11) 
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where a, = +1 is the information symbol, the pulse g7(t) is defined as 


\ ie, 0<t<T 
gr(t) = ; (15.2.12) 


0, otherwise 
and c(t) is the output of the PN code generator which, over a bit interval, is expressed as 


Le—l 


c(t) = Ý. cap(t — nT), (15.2.13) 
n=0 


where L, is the number of chips per bit, T, is the chip interval, and {c} denotes the PN 
code sequence. The code chip sequence {cn} is uncorrelated (white), i.e., 


ElenCm] = Elcn]E[cm], for nm, (15.2.14) 


and each chip is +1 or —1 with equal probability. These conditions imply that E[c,] = 0 
and E [c?] = 1. 

The received signal is assumed to be corrupted by an additive interfering signal i(t). 
Hence, 


r(t) = aogr(t — ta)c(t — tg) cosQu fet + 6) +i(t), (15.2.15) 


where ty represents the propagation delay through the channel and ¢ represents the carrier 
phase shift. Since the received signal r (t) is the output of an ideal bandpass filter in the front 
end of the receiver, the interference i(t) is also a bandpass signal and may be represented as 


i(t) = ie(t)cos 2v fet — is(t) sin 27 fet, (15.2.16) 


where ic(t) and is(t) are the two quadrature components. 

Assuming that the receiver is perfectly synchronized to the received signal, we may 
set t = 0 for convenience. In addition, the carrier phase is assumed to be perfectly 
estimated by a phase-locked loop (PLL). Then, the signal r(t) is demodulated by first 
despreading through multiplication by c(t) and then through cross correlating with 
gr(t)cos(2n fet + $), as shown in Figure 15.5. At the sampling instant t = Foyt the output 
of the correlator is 

Y(T) = Sy + yi (To), (052.17) 


where y; (Tp) represents the interference component, which has the form 


Th 
yi(Tp) = f c(t)i(t)gr(t) cos(2x fet + ġ)dt 


Lc—1 


= a Cn T p(t —nT-)i(t)gr(t) cos(2n fet + )dt 


a D CnVn, (15.2.18) 
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where, by definition, 
(nt+))Te 
Vn = f i(t) cos(2n fet + p)dt. (15.2.19) 
nTe 


The probability of error depends on the statistical characteristics of the interference 
component. Clearly, its mean value is 


E[y;(T)] = 0. (15.2.20) 
Its variance is 


Le-1 Le-1 


28 
Ely? (T) = 7 XO YE ElencmlE aml: 


n=0 m=0 
But E[cpCm] = Sinan. Therefore, 


re 
Ely?(Tp)] = ae > Elva 
n=0 


7 
= 2607 EDW, (15.2.21) 
Ty 


where v = vn, as given by Equation (15.2.19). To determine the variance of v, we must 
postulate the form of the interference. 

First, we will assume that the interference is sinusoidal. Specifically, we assume that 
the interference is at the carrier frequency and has the form 


i(t) = V2P; cos(2nf-t + ©,), (15.2.22) 
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where Py is the average power and ©, is the phase of the interference, which we assume 
to be random and uniformly distributed over the interval (0, 21). If we substitute for i(t) 
in Equation (15.2.19), we obtain 


(n+1)Te 


Vn = V2P; cos(2nf-t + Or) cos(2n fet + )dt 
nTe 


(n+1)Te 


= = VP fi cos(@; — ¢)dt = fe J2P; cos(@; — ¢). (15.2.23) 


Since ©; is a random variable, v, is also random. Its mean value is zero, i.e., 











T, on] 
E[va] = z v2P Í = cos(@; — ¢)d@; = 0. (15.2.24) 
0 
Its mean square value is 
OTP 
Elv; cos? (©; — ¢)dO1 
a 
= np (15.2.25) 
4 
We may now substitute for E [v2] into Equation (15.2.21). Thus, we obtain 
Eb PIT, 
E[y?(1)] = — 7 (15.2.26) 


The ratio of {E [y(Tp)]} to E Ly (Tp) ] i is the signal-to-noise ratio (SNR) at the detector. In 
this case, we have 5 
E 26b 
SNR)p = = —4—— = 2, 15.2.27 
ONE €yPiTe/2 Pi Te ( ) 
To see the effect of the spread-spectrum signal, we express the transmitted energy p as 
e = Ph, (15.2.28) 


where Ps is the average signal power. Then, if we substitute for €, in Equation (15.2.27), 
we obtain 








2PsT, 2Ps 
SNR)p = = ; 152.29 
(SNR)p PT. P/L: ( ) 
where Le = T,/T; is the processing gain. Therefore, the spread-spectrum signal has 


reduced the power of the interference by the factor Le. 

Another interpretation of the effect of the spread-spectrum signal on the sinusoidal 
interference is obtained if we express P;T; in Equation (15.2.29) as a power spectral den- 
sity. Since T, ~ 1/ W, we have 


PT: = P/W =h, (15.2.30) 


where Io is the power spectral density of an equivalent interference in a bandwidth W. 
In effect, the spread-spectrum signal has spread the sinusoidal interference over the wide 
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bandwidth W, creating an equivalent spectrally flat noise with the power spectral density 
Io. Hence, 


28 
(SNR)p = T (15.2.31) 
0 


The probability of error for a direct sequence spread-spectrum system with binary 
PSK modulation is easily obtained from the SNR at the detector, if we make an assumption 
on the probability distribution of the sample y;(T,). From Equation (15.2.18), we note that 
yi (Tp) consists of a sum of Le uncorrelated random variables {c,v,,0 < n < Le — 1}, all 
of which are identically distributed. Since the processing gain Le is usually large in any 
practical system, we may use the central limit theorem to justify a Gaussian probability 
distribution for y;(T,). Under this assumption, the probability of error for the sinusoidal 
interference is 


P=0| 1], (15.2.32) 


where To is the power spectral density of an equivalent broadband interference. A similar 
expression for the error probability is obtained when the interference i(t) is a zero-mean 
broadband random process with a constant power spectral density Jo over the bandwidth 
W of the spread-spectrum signal. 


Example 15.2.1 
The SNR required at the detector to achieve reliable communication in a DS spread-spectrum 
communication system is 13 dB. If the interference-to-signal power at the receiver is 20 dB, 
determine the processing gain required to achieve reliable communication. 


Solution We are given (P;/Ps)ag = 20 dB or, equivalently, P;/Ps = 100. We are also 
given (SNR)p = 13 dB, or equivalently, (SNR)p = 20. The relation in Equation (15.2.29) 
may be used to solve for Le. Thus, 


1/Pr 
L: = = {| — }| (SNR)p = 1000. 
5 ( =) (SNR)p 


Therefore, the processing gain required is 1000 or, equivalently, 30 dB. a 


The Interference Margin. We may express a in the Q-function in Equation 
(15.2.32) as 
p _ PsTeo _ Ps/R _ W/R 
lo Pı/W  Pı/W  Pr/ Ps’ 











(15.2.33) 


Also, suppose we specify a required €,/Io to achieve a desired level of performance. Then, 
using a logarithmic scale, we may express Equation (15.2.33) as 


Pi W Eb 
10 log— = 10 log— — 10 1 — 
ae PER oe (=) 


(=) -(=) - (3) l (15.2.34) 
Ps dB R dB Io dB 
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The ratio (P;/Ps)ap is called the interference margin. This is the relative power advantage 
that an interference may have without disrupting the communication system. 


Example 15.2.2 


Suppose we require an (,/Io)aa = 10 dB to achieve reliable communication. What is the 
processing gain that is necessary to provide an interference margin of 20 dB? 


Solution Clearly, if W/R = 1000, then (W/R)ag = 30 dB and the interference margin is 
(P;/Ps)ag3 = 20 dB. This means that the average interference power at the receiver may be 
100 times the power Ps of the desired signal and we can still maintain reliable 
communication. a 


15.2.3 Performance of Coded Spread-Spectrum Signals 


As shown in Chapter 13, when the transmitted information is coded by a binary linear 
(block or convolutional) code, the SNR at the output of a soft-decision decoder, at large 
SNR, is increased by the coding gain, defined as 


coding gain = R.d#._, (15.2.35) 


where R, is the code rate and di is the minimum Hamming distance of the code. There- 
fore, the effect of coding is to increase the interference margin by the coding gain. Thus, 
Equation (15.2.34) may be modified as 


Pi W 7) 
P\ _(W ETAN A ; 15.2.36 
ea ORL ii (5; dB ; i 


where (C G)ag denotes the coding gain in dB. 


15.3 SOME APPLICATIONS OF DS SPREAD-SPECTRUM SIGNALS 


In this section, we briefly describe the use of DS spread-spectrum signals in four appli- 
cations. First, we consider an application in which the signal is transmitted at very low 
power, so that a listener trying to detect the presence of the signal would encounter great 
difficulty in doing so. A second application is multiple access radio communications. A 
third application involves the use of a DS spread-spectrum signal to resolve the multipath 
in a time-dispersive radio channel. The fourth application is the use of DS spread-spectrum 
signals in wireless local area networks (LANS). 


15.3.1 Low-Detectability Signal Transmission 


In this application, the information-bearing signal is transmitted at a very low power level 
relative to the background channel noise and thermal noise that is generated in the front 
end of areceiver. If the DS spread-spectrum signal occupies a bandwidth W and the power 
spectral density of the additive noise is No W/Hz, the average noise power in the bandwidth 
W is P, N = WNo. 
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The average received signal power at the intended receiver is Pr. If we wish to hide 
the presence of the signal from receivers that are in the vicinity of the intended receiver, 
the signal is transmitted at a power level such that Pr/ Pn <& 1. The intended receiver can 
recover the weak information-bearing signal from the background noise with the aid of the 
processing gain and the coding gain. However, any other receiver that has no knowledge 
of the PN code sequence is unable to take advantage of the processing gain and the coding 
gain. Consequently, the presence of the information-bearing signal is difficult to detect. We 
say that the transmitted signal has a low probability of being intercepted (LPI), and it is 
called an LPI signal. 

The probability of error given in Section 15.2.2 also applies to the demodulation and 
decoding of LPI signals at the intended receiver. 


Example 15.3.1 


A DS spread-spectrum signal is designed so that the power ratio Pr/Py at the intended 
receiver is 10~?. If the desired €,/No = 10 for acceptable performance, determine the mini- 
mum value of the processing gain. 


Solution We may write 6,/No as 


Eo _ PrTo _ PrLcTe f Pr L.= Pr L 
No No No \WNo/) ° \Py) © 





Since ,/No = 10 and Pr/Py = 10~?, it follows that the necessary processing gain is 
Le = 1000. a 


15.3.2 Code Division Multiple Access 


The enhancement in performance obtained from a DS spread-spectrum signal through 
the processing gain and the coding gain can enable many DS spread-spectrum signals to 
occupy the same channel bandwidth provided that each signal has its own pseudorandom 
(signature) sequence. Thus, it is possible to have several users transmit messages simulta- 
neously over the same channel bandwidth. This type of digital communication, in which 
each transmitter—receiver user pair has its own distinct signature code for transmitting over 
a common channel bandwidth, is called code division multiple access (CDMA). 

In the demodulation of each DS spread-spectrum signal, the signals from the other 
simultaneous users of the channel appear as additive interference. The level of interference 
varies as a function of the number of users of the channel at any given time. A major advan- 
tage of CDMA is that a large number of users can be accommodated if each user transmits 
messages for a short period of time. In such a multiple access system, it is relatively easy 
to add new users or to decrease the number of users without reconfiguring the system. 

Next, we determine the number of simultaneous signals that can be accommodated 
ina CDMA system. For simplicity, we assume that all signals have identical average pow- 
ers. In many practical systems, the received signal power level from each user is monitored 
at a central station and power control is exercised over all simultaneous users via a con- 
trol channel that instructs the users on whether to increase or decrease their power level. 
With such power control, if there are N,, simultaneous users, the desired signal-to-noise 
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interference power ratio at a given receiver is 


Ps Ps 1 


— = ——_ = ~. 15.3.1 
Py (Nu-—1)Ps Ny-1 ( ) 


From this relation, we can determine the number of users that can be accommodated simul- 
taneously. Example 15.3.2 illuswates the computation. 


Example 15.3.2 
Suppose that the desired level of performance for a user in a CDMA system is an error proba- 
bility of 1076, which is achieved when 6, /J9 = 20 (13 dB). Determine the maximum number 
of simultaneous users that can be accommodated in a CDMA system if the bandwidth-to-bit- 
rate ratio is 1000 and the coding gain is R.d, = 4 (6 dB). 


Solution Fromthe relationships given in Equations (15.2.36) and (15.3.1), we have 





Ep W/R 
lb N,-1 © ™ 
Ifwe solve for N,, we obtain 
W/R 
N, = —— R,d” +1 
20 min F 
For W/R = 1000 and R,d#,, = 4, we obtain the result that N, = 201. a 


In determining the maximum number of simultaneous users of the channel, we 
implicitly assumed that the pseudorandom code sequences used by the various users are 
uncorrelated and that the interference from other users adds on a power basis only. How- 
ever, orthogonality of the pseudorandom sequences among N,„ users is generally difficult 
to achieve, especially if N,, is large. In fact, the design of a large set of pseudorandom 
sequences with good correlation properties is an important problem that has received 
considerable attention in the technical literature. We shall briefly treat this problem in 
Section 15.4. 

CDMA is a viable method for providing digital cellular telephone service to mobile 
users. In Section 15.7.2, we describe the basic characteristics of the North American digital 
cellular system that employs CDMA. 


15.3.3 Communication over Channels with Multipath 


In Chapter 14, we described the characteristics of fading multipath channels and the design 
of signals for effective communication through such channels. One example of a fad- 
ing multipath channel is ionospheric propagation in the HF frequency band (3-30 MHz), 
where the ionospheric layers serve as signal reflectors. Another example occurs in mobile 
radio communication systems, where the multipath propagation is due to reflection from 
buildings, trees, and other obstacles located between the transmitter and the receiver. 

Our discussion on signal design in Chapter 14 focused on frequency selective chan- 
nels, where the signal bandwidth W is larger than the coherence bandwidth Bop of the 
channel. If W > Bop, we may consider two approaches to signal design. One approach is 
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to use orthogonal frequency-division multiplexing (OFDM). Thus, we subdivide the avail- 
able bandwidth W into N subchannels such that the bandwidth per channel 4 < Boy. In 
this way, each subchannel is frequency non-selective and the signals in each subchannel 
satisfy the condition that the symbol interval T >> Tm, where Tm is the multipath spread of 
the channel. Thus, intersymbol interference is avoided. A second approach is to design the 
signal to utilize the entire signal bandwidth W and transmit it on a single carrier. In this 
case, the channel is frequency selective and the multipath components with differential 
delays of $ or greater become resolvable. 

DS spread spectrum is a particularly effective way to generate a wideband signal 
for resolving multipath signal components. By separating the multipath components, we 
may also reduce the effects of fading. For example, in line-of-sight (LOS) communica- 
tion systems where there is a direct path and a secondary propagation path resulting from 
signals reflecting from buildings and surrounding terrain, the demodulator at the receiver 
may synchronize to the direct signal component and ignore the existence of the multi- 
path component. In such a case, the multipath component becomes a form of interference 
{intersymbol interference (ISD] on the demodulation of subsequent transmitted signals. 

ISI can be avoided if we are willing to reduce the symbol rate 4 such that T > Tn. 
In this case, we employ a DS spread-spectrum signal with a bandwidth W to resolve the 
multipath. Thus, the channel is frequency selective and the appropriate channel model 
is the tapped-delay-line model with time-varying coefficients, as shown in Figure 14.6. 
The optimum demodulator for this channel is a filter matched to the tapped-delay channel 
model called the RAKE demodulator, as described in Section 14.3.3. 


15.3.4 Wireless LANs 


Spread-spectrum signals have been used in the IEEE wireless LAN standards 802.11 and 
802.11b, which operate in the 2.4 GHz ISM (industrial, scientific, and medical) unlicensed 
frequency band. The available bandwidth is subdivided into 14 overlapping 22 MHz chan- 
nels, although not all channels are used in all countries. In the United States, only channels 
1 through 11 are used. 

In the 802.11 standard, an 11-chip Barker sequence is modulated and transmitted at 
a chip rate of 11 MHz, i.e., the chip duration is 0.909 usec. The 11-chip Barker sequence 
is {1, —1, 1,1, —1, 1, 1, 1, —1, —1, —1}. As described in the next section, this sequence is 
desirable because its autocorrelation has sidelobes of less than or equal to 1, compared with 
the peak autocorrelation value of 11. The Barker sequence is modulated either with BPSK 
or QPSK. When BPSK is used with 11 chips per bit, a data rate of 1 Mbps is achieved. 
When QPSK modulation is used with 11 chips per symbol (2 bits), a data rate of 2 Mbps 
is achieved. 

Direct sequence spread spectrum is also used in the higher speed (second generation) 
IEEE 802.11b wireless LAN standard, which operates in the same 2.4 GHz ISM band. In 
802.11b, the 11-MHz chip rate is maintained, but the Barker sequence is replaced by a 
set of 8-chip waveform sequences, called complementary code shift keying (CCK), which 
can be viewed as direct-sequence spread-spectrum modulation with multiple spreading 
sequences. The use of CCK modulation results in a data rate of 11 Mbps. 
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15.4 GENERATION OF PN SEQUENCES 


A pseudorandom or pseudonoise sequence is a code sequence of 1’s and 0’s whose autocor- 
relation has properties similar to those of white noise. In this section, we briefly describe 
the construction of some PN sequences and their autocorrelation and cross-correlation 
properties. 

The most widely known binary PN code sequences are the maximum-length shift- 
register sequences. A maximum-length shift-register sequence, or m-sequence for short, 
has the length L = 2” — 1 bits and is generated by an m-stage shift register with lin- 
ear feedback, as illustrated in Figure 15.6. The sequence is periodic with period L. Each 
period contains 2”7! ones and 2”"—! — 1 zeros. Table 15.1 lists shift register connections 
for generating maximum-length sequences. 

In DS spread-spectrum applications, the binary sequence with elements {0, 1} is 
mapped into a corresponding binary sequence with elements {—1, 1}. We shall call the 
equivalent sequence {c,} with elements {—1, 1} a bipolar sequence. 





— m stages 








Figure 15.6 General m-stage shift register with linear feedback. 


TABLE 15.1 SHIFT-REGISTER CONNECTIONS FOR GENERATING MAXIMUM-LENGTH SEQUENCES 


Stages Connected Stages Connected Stages Connected 
m to Modulo-2-Adder m to Modulo-2-Adder m to Modulo-2 Adder 
2 1,2 13 1, 10, 11, 13 24 1, 18, 23, 24 
3 1,3 14 1, 5, 9, 14 25 1, 23 
4 1,4 15 1,15 26 1, 21, 25, 26 
5 1,4 16 1,5, 14, 16 27 1, 23, 26, 27 
6 1,6 17 1,15 28 1, 26 
7 1,7 18 1,12 29 1, 28 
8 1,5, 6,7 19 1,15, 18,19 30 1, 8, 29, 30 
9 1,6 20 1, 18 31 1,29 
10 1,8 21 1, 20 32 1,11, 31, 32 
11 1,10 22 1,22 33 1, 21 
12 1,7, 9, 12 23 1,19 34 1, 8, 33, 34 
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An important characteristic of a periodic PN.sequence is its autocorrelation function, 
which is usually defined in terms of the bipolar sequences {cn} as 


L 
Rm] =Y CrCnim, OSm<L—1, (15.4.1) 


n=1 


where L is the period of the sequence. Since the sequence {c,} is periodic with period L, 
the autocorrelation sequence {R,[m]} is also periodic with period L. 

Ideally, a PN sequence should have an autocorrelation function that has correlation 
properties similar to white noise. That is, the ideal autocorrelation sequence for {cp} is 
R[0] = L and Rm] = 0 for 1 < m < L — 1. In the case of m-sequences, the autocorre- 
lation sequence is 


L, m=0 
R.[m] = i (15.4.2) 
—-1,l<m<L-l1 


For long m-sequences, the size of the off-peak values of R,[m] relative to the peak value 
R,{0], i.e., the ratio R-[m]/R-[0] = —1/L, is small and, from a practical viewpoint, incon- 
sequential. Therefore, m-sequences are very close to ideal PN sequences when viewed in 
terms of their autocorrelation function. 

In some applications, the cross correlation properties of PN sequences are as impor- 
tant as the autocorrelation properties. For example, in CDMA, each user is assigned a 
particular PN sequence. Ideally, the PN sequences among users should be mutually uncor- 
related so that the level of interference experienced by one user from the transmissions 
of other users adds on a power basis. However, in practice, the PN sequences of different 
users exhibit some correlation. 

To be specific, consider the class of m-sequences. We know that the periodic cross 
correlation function between a pair of m-sequences of the same period can have relatively 
large peaks. Table 15.2 lists the peak magnitude Rmax for the periodic cross correlation 
between pairs of m-sequences for 3 < m < 12. It also lists the number of m-sequences 
of length L = 2” — 1 for3 < m < 12. We observe that the number of m-sequences 
of length L increases rapidly with m. We also observe that, for most sequences, the peak 
magnitude Rmax of the cross-correlation function is a large percentage of the peak value of 
the autocorrelation function. Consequently, m-sequences are not suitable for CDMA com- 
munication systems. Although it is possible to select a small subset of m-sequences that 
have a relatively lower cross-correlation peak value than Rmax, the number of sequences in 
the set is usually too small for CDMA applications. 

Methods for generating PN sequences with better periodic cross-correlation proper- 
ties than m-sequences have been developed by Gold (1967, 1968) and by Kasami (1966). 
Gold sequences are constructed by taking a pair of specially selected m-sequences, called 
preferred m-sequences, and forming the modulo-2 sum of the two sequences for each of 
L cyclicly shifted versions of one sequence relative to the other sequence. Thus, L Gold 
sequences are generated as illustrated in Figure 15.7. For m odd, the maximum value of 
the cross-correlation function between any pair of Gold sequences is Rmax = /2L. For m 
even, Rmax = VL. 
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TABLE 15.2 PEAK CROSS CORRELATIONS OF M-SEQUENCES AND GOLD SEQUENCES 





m Sequences 


Peak Cross Gold Sequences 
m L=2™'! Number Correlation Rmax Rmmax/ R(O) Rmax Rmin/ R[OJ 
3 7 2 5 0.71 5 0.71 
4 15 2 9 0.60 9 0.60 
5 31 6 11 0.35 9 0.29 
6 63 6 23 0.36 17 0.27 
7 127 18 41 0.32 17 0.13 
8 255 16 95 0.37 33 0.13 
9 511 48 113 0.22 33 0.06 
10 1023 60 383 0.37 65 0.06 
11 2047 176 287 0.14 65 0.03 
12 4095 144 1407 0.34 129 0.03 





Figure 15.7 Generation of Gold sequences of length 31. 


Kasami (1966) described a method for constructing PN sequences by decimating an 
m-sequence. In Kasami’s method of construction, every 2™/2 + 1 bit of an m-sequence is 
selected. This method of construction yields a smaller set of PN sequences compared with 
Gold sequences, but their maximum cross-correlation value is Rmax = VL. . 

It is interesting to compare the peak value of the cross-correlation function for Gold 
sequences and for Kasami sequences with a known lower bound for the maximum cross 
correlation between any pair of binary sequences of length L. Given a set of N sequences 
of period L, a lower bound on their maximum cross correlation is 


N-1 
Rox = L,{——., 15.4.3 


which, for large values of L and N, is well approximated as Rmax > JVL . Hence, Kasami 
sequences satisfy the lower bound, and they are optimal. On the other hand, Gold sequences 
with m odd have an Rmax = V 2L. Hence, they are slightly suboptimal. 
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15.5 FREQUENCY-HOPPED SPREAD SPECTRUM 


In FH spread spectrum, the available channel bandwidth W is subdivided into a large num- 
ber of nonoverlapping frequency slots. In any signaling interval, the transmitted signal 
occupies one or more of the available frequency slots. The selection of the frequency slot 
(s) in each signal interval is made pseudorandomly according to the output from a PN 
generator. 

A block diagram of the transmitter and receiver for an FH spread-spectrum sys- 
tem is shown in Figure 15.8. The modulation is either binary or M-ary FSK (MFSK). For 
example, if binary FSK is employed, the modulator selects one of two frequencies, such as 
fo or fi, corresponding to the transmission of a0 or a 1. The resulting binary FSK signal is 
translated in frequency by an amount determined by the output sequence from a PN gener- 
ator, which is used to select a frequency fe that is synthesized by the frequency synthesizer. 
This frequency is mixed with the output of the FSK modulator and the resultant signal is 
transmitted over the channel. For example, by taking m bits from the PN generator, we may 
specify 2” — 1 possible carrier frequencies. Figure 15.9 illustrates an FH signal pattern. 

At the receiver, there is an identical PN sequence generator, which is synchronized 
with the received signal and is used to control the output of the frequency synthesizer. 
Thus, the pseudorandom frequency translation introduced at the transmitter is removed at 
the demodulator by mixing the synthesizer output with the received signal. The resultant 
signal is then demodulated via an FSK demodulator. A signal for maintaining synchronism 
of the PN sequence generator with the FH received signal is usually extracted from the 
received signal. 

Binary PSK modulation generally yields better performance than binary FSK. How- 
ever, it is difficult to maintain phase coherence in the synthesis of the frequencies used in 
the hopping pattem and, also, in the propagation of the signal over the channel as the sig- 
nal is hopped from one frequency to another over a wide bandwidth. Consequently, FSK 
modulation with noncoherent demodulation is usually employed in FH spread-specwum 
systems. 

The frequency-hopping rate, denoted as R}, may be either equal to the symbol rate, 
lower than the symbol rate, or higher than the symbol rate. If R, is equal to or lower than 
the symbol rate, the FH system is called a slow hopping system. If R, is higher than the 
symbol rate, i.e., there are multiple hops per symbol, the FH system is called a fast hopping 
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Figure 15.8 Block diagram of an FH spread-spectrum system. 
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system. However, there is a penalty incurred in subdividing an information symbol into 
several frequency-hopped elements, because the energy from these separate elements is 
combined noncoherently. 

FH spread-spectrum signals may be used in CDMA where many users share a com- 
mon bandwidth. In some cases, an FH signal is preferred because of the stringent syn- 
chronization requirements inherent in DS spread-spectrum signals. Specifically, in a DS 
system, timing and synchronization must be established to within a fraction of a chip inter- 
val Te = 1/ W. On the other hand, in an FH system, the chip interval T is the time spent in 
transmitting a signal in a particular frequency slot of bandwidth B < W. But this interval 
is approximately 1/B, which is much larger than 1/ W. Hence, the timing requirements in 
an FH system are not as stringent as in a DS system. 

Next, we shall evaluate the performance of FH spread-spectrum systems under the 
condition that the system is slow hopping. 


15.5.1 Slow Frequency-Hopping Systems and Partial-Band Interference 


Consider a slow frequency-hopping system in which the hop rate R} = 1 hop per bit. We 
assume that the interference on the channel is broadband and is characterized as AWGN 
with power spectral density Jọ. Under these conditions, the probability of error for the 
detection of noncoherently demodulated binary FSK is 

1 


Po = Bos (15.5.1) 


where pp = p/o is the SNR per bit. 
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As in the case of a DS spread-spectrum system, we observe that €,, the energy 
per bit, can be expressed as p = PsT, = Ps/R, where Ps is the average transmitted 
power and R is the bit rate. Similarly, J) = P/W, where P; is the average power of the 
broadband interference and W is the available channel bandwidth. Therefore, the SNR pp 
can be expressed as 
_ b _ W/R 


san Ot (15.5.2) 
lo  Pi/Ps 


Pb 
where W/R is the processing gain and P,/Ps is the interference margin for the FH spread- 
spectrum signal. Note that the relationship in Equation (15.5.2) for the FH spread-spectrum 
signal is identical to that given by Equation (15.2.33) for the DS spread-spectrum sig- 
nal. Therefore, frequency hopping provides basically the same benefits as direct sequence 
spreading. 

Slow FH spread-spectrum systems are particularly vulnerable to partial-band inter- 
ference that may result in FH CDMA systems. To be specific, suppose that the partial-band 
interference is modeled as a zero-mean Gaussian random process with a flat power spec- 
tral density over a fraction of the total bandwidth W and zero in the remainder of the 
frequency band. In the region or regions where the power spectral density is nonzero, its 
value is S; (f) = Io/B, where O < £ < 1. In other words, the interference average power 
P; is assumed to be constant. 

Let us consider the worst case partial-band interference by selecting the value of 
B that maximizes the error probability. In an uncoded slow-hopping system with binary 
FSK modulation and noncoherent detection, the transmitted frequencies are selected with 
uniform probability in the frequency band W. Consequently, the received signal will be 
corrupted by interference with probability 8. When the interference is present, the proba- 
bility of error is 1/2 exp(—8p,/2) and when it is not, the detection of the signal is assumed 
to be error free. Therefore, the average probability of error is 


Py(B) = Ê eA? 


2 e 
ren pee 
=z exp( ER) . (15.5.3) 


Figure 15.10 illustrates the error rate as a function of p, for several values of 6. By 
differentiating P,(8), and solving for the value of 6 that maximizes P,(6), we find that 


Yon py>2 
Pat PRES (15.5.4) 
1, Pp <2 


The corresponding error probability for the worst case partial-band interference is 


(15.5.5) 


Z e'/pp, po =2 
b = 2 
te l2, Pb < 2 


which is also shown in Figure 15.10. Whereas the error probability decreases exponentially 
for full-band interference as given by Equation (15.5.1), the error probability for worst case 
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partial-band interference decreases only inversely with @,/Jo. This result is similar to the 
error probability for binary FSK in a Rayleigh fading channel. 

In our discussion of signal design for efficient and reliable communication over a 
fading channel in Chapter 14, we found that diversity, which can be obtained by simple 
repetition of the transmitted information bit on different frequencies (or by means of block 
or convolutional coding), provides a significant improvement in performance relative to 
uncoded signal transmission. It should not be surprising that the same type of signal coding 
is also effective on partial-band interference channels. In fact, it has been shown by Viterbi 
and Jacobs (1975) that by optimizing the code design for the partial-band interference, the 
communication system can achieve an average bit-error probability of 


P, = e”, (15.5.6) 


Therefore, the probability of error achieved with the optimum code design decreases expo- 
nentially with an increase in SNR and is within 3 dB of the performance obtained in an 
AWGN channel. Thus, the penalty due to partial-band interference is reduced significantly. 
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15.5.2 Fast Frequency Hopping 


In fast FH systems, the frequency-hop rate R, is some multiple of the symbol rate. Basi- 
cally, each (M-ary) symbol interval is subdivided into N subintervals, which are called 
chips and one of M frequencies is transmitted in each subinterval. Fast FH systems are 
particularly attractive for military communications. In such systems, the hop rate R;, may 
be selected sufficiently high so that a potential intentional interferer does not have suffi- 
cient time to detect the presence of the transmitted frequency and to synthesize a jamming 
signal that occupies the same bandwidth. 

To recover the information at the receiver, the received signal is first de-hopped by 
mixing it with the hopped carrier frequency. This operation removes the hopping pattem 
and brings the received signal in all subintervals (chips) to a common frequency band that 
encompasses the M possible transmitted frequencies. The signal in each subinterval is then 
passed through the M matched filters (or correlators) tuned to the M possible transmitted 
frequencies which are sampled at the end of each subinterval and passed to the detec- 
tor. The detection of the FSK signals is noncoherent. Hence, decisions are based on the 
magnitude of the matched filter (or correlator) outputs. 

Since each symbol is transmitted over N chips, the decoding may be performed 
either on the basis of hard decisions or soft decisions. The following example illustrates 
the decoding based on hard decisions. 


Example 15.5.1 


Suppose that binary FSK is used to transmit binary symbols, and each symbol is transmitted 
over N frequency hops, where N is odd. Determine the probability of error for an AWGN 
channel if hard-decision decoding is used. 


Solution The probability of error for noncoherent detection of binary FSK for each hop is 
1 





p= 5 e7Pbl2N (15.5.7) 
where g 
N 
> = = (15.5.8) 


is the SNR/chip and ©, is the total bit energy. The decoder decides in favor of the transmitted 
frequency that is larger in at least (N + 1)/2 chips. Thus, the decision is made on the basis 
of a majority vote given the decisions on the N chips. Consequently, the probability of a bit 


error is 
N 


N 
P, = D (*) p™(1— pY", (15.5.9) 


m=(N-+1)/2 


where p is given by Equation (15.5.7). We should note that the error probability P, for hard- 
decision decoding of the N chips will be higher than the error probability for a single hop/bit 
FSK system, which is given by Equation (15.5.1), when the SNR/bit p, is the same in the two 
systems (see Problem 15.17). E 


The altemative to hard-decision decoding is soft-decision decoding in which the 
magnitudes (or magnitudes squared) of the corresponding matched filter outputs are 
summed over the N chips and a single decision is made based on the frequency giving the 
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largest output. For example, if binary orthogonal FSK is used to transmit the information, 
the two soft-decision metrics for the N chips based on square-law combining are 
N 2 
Sp 
DM, = | — +v ? 
1 B N + Vik 





a (15.5.10) 
DM: = D vel", 
k=l 


where {v1,} and {vx} are the noise components from the two matched filters for the 
N chips. Since frequency fı is assumed to have been transmitted, a decision error occurs 
when DM, > DM. The probability of this event error for additive Gaussian noise may be 
obtained in closed form, although its derivation is cumbersome. The final result is 





1 po \i 
P= a eo? YK (F) , (15.5.11) 
i=0 


where the set {K;} are constants, which may be expressed as 


N-1-i 
1 2N — 1 
K; = 7 J ( z ) ; (15.5.12) 


r=0 


Theerror probability for soft-decision decoding given by Equation (15.5.11) is lower 
than that for hard-decision decoding given by Equation (15.5.9) for the same €,/No. The 
difference in performance is the loss in hard-decision decoding. 

If soft-decision decoding is used in the presence of partial-band interference, it is 
important to scale (or normalize) the matched filter outputs in each hop, so that a strong 
interference that falls within the transmitted signal band in any hop does not dominate the 
output of the combiner. A good strategy in such a case is to normalize, or clip, the matched 
filter outputs from each hop if their values exceed some threshold that is set near (slightly 
above) the mean of the signal-plus-noise power level. Altematively, we may monitor the 
noise power level and scale the matched filter outputs for each hop by the reciprocal of the 
noise power level. Thus, the noise power levels from the matched filter outputs are normal- 
ized. Therefore, with proper scaling, a fast FH spread-spectrum system will not be as vul- 
nerable to partial-band interference because the transmitted information/bit is distributed 
(or spread) over N frequency hops. 


15.5.3 Applications of FH Spread Spectrum 


FH spread spectrum is a viable alternative to DS spread spectrum for protection against 
narrowband and broadband interference that is encountered in CDMA. In CDMA systems 
based on frequency hopping each transmitter—receiver pair is assigned its own pseudoran- 
dom frequency-hopping pattern. Aside from this distinguishing feature, the transmitters 
and receivers of all users may be identical; i.e., they have identical encoders, decoders, 
modulators, and demodulators. 
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CDMA systems based on FH spread-spectrum signals are particularly attractive for 
mobile (land, air, sea) users because timing (synchronization) requirements are not as strin- 
gent as in a DS spread-spectrum system. In addition, frequency-synthesis techniques and 
associated hardware have been developed that make it possible to frequency-hop over 
bandwidths that are significantly larger, by one or more orders of magnitude, than those 
currently possible with DS spread-spectrum signals. Consequently, larger processing gains 
are possible by FH, which more than offset the loss in performance inherent in noncoherent 
detection of the FSK-type signals. 

FH is also effective against intentional interference. As we have described above, 
an FH M-ary (M > 2) FSK system that employs coding, or simply repeats the informa- 
tion symbol on multiple hops (repetition coding), is very effective against a partial-band 
interference. As a consequence, the interferer’s threat is reduced to that of an equivalent 
broadband noise interference whose transmitter power is spread across the channel band- 
width W. 


15.6 SYNCHRONIZATION OF SPREAD-SPECTRUM SYSTEMS 


Time synchronization of the receiver to the received spread-spectrum signal may be sep- 
arated into two distinct phases. There is an initial acquisition phase, during which time 
the receiver establishes time synchronization by detecting the presence of a special initial 
acquisition sequence. The initial acquisition phase is followed by the transmission of data, 
during which period the receiver must track the signal timing. 


15.6.1 Acquisition Phase 


In a DS spread-spectrum system, the PN code sequence must be synchronized in time to 
within a small fraction of the chip interval Te = 1/ W. The problem of initial synchroniza- 
tion may be viewed as one in which we attempt to synchronize the receiver clock to the 
transmitter clock. Usually, extremely accurate and stable time clocks are used in spread- 
spectrum systems in order to reduce the time uncertainty between the receiver clock and 
the transmitter clock. Nevertheless, there is always an initial timing uncertainty that is due 
to propagation delay in the transmission of the signal through the channel. This is espe- 
cially a problem when communication is taking place between two mobile users. In any 
case, the usual procedure for establishing initial synchronization is for the transmitter to 
send a known pseudorandom sequence to the receiver. The receiver is continuously in a 
search mode looking for this sequence in order to establish initial synchronization. 

Suppose that the initial timing uncertainty is T, seconds and the chip duration is Te. 
Since initial synchronization takes place in the presence of additive noise and, perhaps 
other interference, it is necessary to dwell for Tz = NT; sec in order to test synchronism 
at each time instant, where N is some positive integer. If we search over the time uncer- 
tainty interval in (coarse) time steps of T,/2, then the time required to establish initial 
synchronization is 


u 


— Tz = 2NT,. 
T./2 "4 2NT, 


Tinit sync = 
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Clearly, the synchronization sequence transmitted to the receiver must be at least as long 
as 2 NT, seconds in order for the receiver to have sufficient time to perform the necessary 
search in a serial fashion. 

In principle, matched filtering or cross correlation are optimum methods for estab- 
lishing initial synchronization in the presence of additive Gaussian noise. A filter matched 
to the known data waveform generated from the known pseudorandom sequence continu- 
ously compares its output with a predetermined threshold. When the threshold is exceeded, 
initial synchronization is established and the demodulator enters the “data receive” mode. 

Alternatively, we may implement a sliding correlator as shown in Figure 15.11. The 
correlator cycles through the time uncertainty, usually in discrete-time intervals of T,/2 
seconds or less. The cross correlation is performed over the time interval NT,, where N is 
the number of chips in the synchronization sequence, and the correlator output is compared 
with a threshold to determine if the known signal sequence is present. If the threshold is not 
exceeded, the known reference sequence is advanced by T,/2 seconds and the correlation 
process is repeated. These operations are performed until a signal is detected or until the 
search has been performed over the time uncertainty interval T,,. In the case of the latter 
outcome, the search process is repeated. 

A similar procedure may be used for FH signals. In this case, the problem is to 
synchronize the PN code sequence generated at the receiver that controls the hopped- 
frequency pattern. To accomplish this initial synchronization, a known frequency-hopped 
signal is transmitted to the receiver. The initial acquisition system at the receiver looks for 
this known FH signal pattern. For example, a band of matched filters tuned to the trans- 
mitted frequencies in the known pattern may be employed. Their outputs must be properly 
delayed, envelope or square-law detected, weighted, if necessary, and added to produce 
the signal output which is compared with a threshold. A signal present (signal acquisition) 
is declared when the threshold is exceeded. The search process is usually performed con- 
tinuously in time until a threshold is exceeded. A block diagram illustrating their signal 
acquisition scheme is given in Figure 15.12. As an alternative, a single matched filter and 
envelope detector may be used preceded by a frequency-hopping pattern generator and fol- 
lowed by a threshold detector. This configuration, which is shown in Figure 15.13, is based 
on a serial search and is akin to the sliding correlator for DS spread-spectrum signals. 

The sliding correlator for DS signals and its counterpart shown in Figure 15.13 for 
FH signals basically perform a serial search that is generally time consuming. As an 
alternative, one may employ some degree of parallelism by having two or more such 
correlators operating in parallel and searching over nonoverlapping time slots. In such 
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Figure 15.12 System for acquisition of an FH signal. 


a case, the search time is reduced at the expense of a more complex and costly 
implementation. 

During the search mode, there may be false alarms that occur occasionally due to 
additive noise and other interference. To handle the occasional false alarms, it is necessary 
to have an additional method or circuit that checks to confirm that the received signal at 
the output of the correlator remains above the threshold. With such a detection strategy, 
a large noise pulse that causes the matched filter output to exceed the threshold will have 
only a transient effect on synchronization, since the matched filter output will fall below 
the threshold once the large noise pulse passes through the filter. On the other hand, when 
a signal is present, the correlator or matched filter output will remain above the threshold 
for the duration of the transmitted signal. Thus, if confirmation fails, the search for signal 
synchronization is resumed. 

In the above discussion, we considered only time uncertainty in establishing initial 
synchronization. However, another aspect of initial synchronization is frequency uncer- 
tainty. If the transmitter and, or, the receiver are mobile, the relative velocity between them 
results in a Doppler frequency shift in the received signal relative to the transmitted signal. 
Since the receiver does not know the relative velocity, a priori, the Doppler frequency shift 
is unknown and must be determined by means of a frequency-search method. Such a search 
is usually accomplished in parallel over a suitably quantized frequency uncertainty interval 
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and serially over the time uncertainty interval. A block diagram of this scheme for DS 
spread-spectrum signals is shown in Figure 15.14. Appropriate Doppler frequency-search 
methods can also be devised for FH signals. 


15.6.2 Tracking 


Oncethe signal is acquired, the initial synchronization process is stopped and fine synchro- 
nization and tracking begins. The tracking maintains the PN code generator at the receiver 
in synchronism with the received signal. Tracking includes fine-chip synchronization. 

For a DS spread-spectrum signal, tracking is usually performed by means of a track- 
ing loop, called a delay-locked loop (DLL), as shown in Figure 15.15. In this tracking loop, 
the received signal is applied to two multipliers, where it is multiplied by two outputs from 
the local PN code generator which are delayed relative to each other by an amount of 
26 < Te. Thus, the product signals are the cross correlations between the received signal 
and the PN sequence at the two values of delay. These products are bandpass filtered, enve- 
lope (or square-law) detected, and then subtracted. This difference signal is applied to the 
loop filter that drives the voltage-controlled clock (VCC). The VCC output serves as the 
clock for the PN code signal generator. 

If the synchronism is not exact, the filtered output from one correlator will exceed 
the other and the VCC will be appropriately advanced or delayed. At the equilibrium point, 
the two filtered correlator outputs will be equally displaced from the peak value, and the 
PN code generator output will be exactly synchronized to the received signal which is fed 
to the demodulator. We observe that this implementation of the DLL for tracking the DS 
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Figure 15.14 Initial search for the Doppler frequency offset in a DS system. 
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signal is equivalent to the early—late gate bit-tracking synchronizer previously described 
in Chapter 8. 

An alternative method for time tracking a DS signal is touse a tau-dither loop (TDL), 
which is illustrated by the block diagram in Figure 15.16. The TDL employs only a single 
“arm” instead of the two “arms” shown in Figure 15.15. By providing a suitable gating 
waveform, it is possible to make this single “arm” implementation appear to be equivalent 
to the two “arm” realization. In this case, the cross correlator output is regularly sampled 
at two values of delay, by stepping the code clock forward and backward in time by an 
amount ô. The envelope of the cross correlation that is sampled at +ô has an amplitude 
modulation whose phase relative to the tau-dither modulator determines the sign of the 
tracking error. 

One advantage of the TDL is the less costly implementation resulting from elimina- 
tion of one of the two arms that are employed in the conventional DLL. A second and less 
apparent advantage is that the TDL does not suffer from performance degradation that is 
inherent in the DLL when the amplitude gain in the two arms is not properly balanced. 

Both the DLL and the TDL generate an error signal by sampling the signal correla- 
tion function at +ô off the peak, as shown in Figure 15.17(a). This generates an error signal 
as shown in Figure 15.17(b). The analysis of the performance of the DLL is similar to that 
for the PLL, previously described in Chapter 8. If it were not for the envelope detectors in 
the two arms of the DLL, the loop resembles a Costas loop. In general, the variance of the 
time-estimation error in the DLL is inversely proportional to the loop SNR, which depends 
on the input SNR to the loop and on the loop bandwidth. Its performance is somewhat 
degraded as in the squaring PLL by the nonlinearities inherent in the envelope detectors, 
but this degradation is relatively small. 

A tracking method for FH spread-spectrum signals is illustrated in Figure 15.18. 
This method is based on the premise that, although initial acquisition has been achieved, 
there is a small timing error between the received signal and the received clock. The band- 
pass filter is tuned to a single intermediate frequency and its bandwidth is of the order of 
1/T-, where Te is the chip interval. Its output is envelope detected and then multiplied by 
the clock signal to produce a three-level signal, as shown in Figure 15.19, which drives 
the loop filter. Note that when the chip transitions from the locally generated sinusoidal 
waveform do not occur at the same time as the transitions in the incoming received signal, 
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Figure 15.19 Waveforms for the tracking method of FH signals shown in Figure 15.18. 


the output of the loop filter will be either negative or positive, depending on whether the 
VCC is lagging or advanced ‘relative to the timing of the input signal. This error signal 
from the loop filter will provide the control signal for adjusting the VCC timing signal so 
as to drive the frequency synthesized FH signal to proper synchronism with the received 
signal. 


15.7 DIGITAL CELLULAR COMMUNICATION SYSTEMS 


The demand to provide telephone service for people traveling in automobiles, buses, trains, 
and airplanes has been steadily increasing over the past three to four decades. To meet 
this demand, radio-transmission systems have been developed to link the mobile-telephone 
user to the terrestrial-telephone network. Today, radio-based systems make it possible for 
people to communicate via telephone while traveling on airplanes and motor vehicles. In 
this section, we will briefly describe the cellular telephone system that provides telephone 
service to people with handheld mobile telephones. 

A major problem with the establishment of any radio communication system is the 
availability of a portion of the radio spectrum. In the case of radio telephone service, the 
Federal Communications Commission (FCC) in the United States has assigned parts of 
the UHF band in the range 806-890 MHz and in the 1900 MHz band for this use. Similar 
frequency assignments in the UHF band have been made in Europe and Japan. 
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Figure 15.20 Mobile-radio base station. 


The cellular-radio concept was adopted as a method for the efficient utilization of the 
available frequency spectrum, especially in highly populated metropolitan areas, where the 
demand for mobile telephone services is the greatest. A geographic area is subdivided into 
cells, each of which contains a base station, as illustrated in Figure 15.20. Each base station 
is connected via telephone lines to a mobile-telephone-switching office (MTSO) which, in 
turn, is connected via telephone lines to a telephone central office (CO) of the terrestrial 
telephone network. 

A mobile user communicates via radio with the base station within the cell. The base 
station routes the call through the MTSO to another base station (if the called party is 
located in another cell) or to the central office of the terrestrial-telephone network (if the 
called party is not a mobile). Each mobile telephone is identified by its telephone number 
and the telephone serial number assigned by the manufacturer. These numbers are auto- 
matically transmitted to the MTSO during the initialization of the call for authentication 
and billing purposes. 

A mobile user initiates a telephone call in the usual manner by keying in the desired 
telephone number and pressing the “send” button. The MTSO checks the authentication of 
the mobile user and assigns an available frequency channel for the radio transmission of 
the voice signal from the mobile to the base station. The frequency assignment is sent to the 
mobile telephone via a supervisory control channel. A second frequency is assigned for the 
radio transmission from the base station to the mobile user. The simultaneous transmission 
between the two parties is called full-duplex operation. The MTSO interfaces with the cen- 
tral office of the telephone network to complete the connection to the called party. All tele- 
phone communications between the MTSO and the telephone network are via wideband 
trunk lines that carry speech signals from many users. Upon completion of the telephone 
call, the two parties hang up and the radio channel becomes available for other users. 

During the phone call, the MTSO monitors the signal strength of the radio tansmis- 
sion from the mobile user to the base station. If the signal strength drops below a preset 
threshold, the MTSO views this as an indication that the mobile user is moving out of the 
initial cell into a neighboring cell. By communicating with the base stations of neighboring 
cells, the MTSO finds a neighboring cell that receives a stronger signal and automatically 
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switches or hands-off the mobile user to the base station of the adjacent cell. The switching 
is performed in a fraction of a second and is generally transparent to the two parties. 

The cellular radio telephone system is designed such that the transmitter powers of 
the base station and the mobile users are sufficiently small, so that signals do not propa- 
gate beyond immediately adjacent cells. This allows frequencies to be reused in other cells 
outside of the adjacent cells. Consequently, by making the cells smaller and reducing the 
radiated power, it is possible to increase frequency reuse and, thus, to increase the band- 
width efficiency and the number of mobile users. Current cellular systems employ cells 
with a radius in the range of 5-18 km. 

Below, we present an overview of two types of current digital cellular communica- 
tion systems. One is the GSM (Global System for Mobile Communication) system that 
is widely used in Europe and other parts of the world. The second is the CDMA system, 
which is widely used in North America and some countries in the Far East. 

There are three basic methods that are currently used to provide channel access to 
multiple users in a communication network. One simple method is to subdivide the avail- 
able channel bandwidth into a number, say, K, of frequency nonoverlapping subchannels 
and to assign a subchannel to each user upon request. This method is called frequency- 
division multiple access (FDMA). It is commonly used in wireline channels to accommo- 
date multiple users for voice and data transmission. FDMA is also used in the first gen- 
eration of cellular communication systems in which analog FM is employed, as described 
above. 

A second method for providing access to multiple users is to subdivide a time inter- 
val, called a frame, into K nonoverlapping subintervals, each of duration T;/K, where 
T; is the frame duration. Then, each user is assigned to a particular subinterval, or time 
slot, within a frame. This multiple access method is called time-division multiple access 
(TDMA), and it is used in the GSM system described in Section 15.7.1. 

The third method for providing multiple access is CDMA, which was described in 
Section 15.3.2. In CDMA, each user is assigned a unique spreading sequence, such as, a 
Gold sequence or a Kasami sequence. All users are allowed to transmit simultaneously over 
the same channel. In FDMA and TDMA, the signals of the multiple users are nonoverlap- 
ping in either frequency or time; however, in CDMA, the transmitted signals completely 
overlap in both time and frequency. By assigning unique code sequences to multiple users, 
each user is able to separate its desired signal (via cross correlation) from the other user sig- 
nals, which appear as interference. The digital cellular system based on the IS-95 standard 
employs CDMA. This system is described in Section 15.7.2. 


15.7.1 The GSM System 


GSM was developed in Europe to provide a common digital cellular communication sys- 
tem that would serve all of Europe. It is now widely used in many parts of the world. 
The GSM system employs the frequency band 890-915 MHz for signal transmission from 
mobile transmitters to base station receivers-(uplink or reverse link) and the frequency band 
935-960 MHz for transmission from the base stations to the mobile receivers (downlink or 
forward link). The two 25-MHz frequency bands are each subdivided into 125 channels, 
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Figure 15.21 Frequency reuse in a cellular system with a reuse factor N = 4 or N = 7. 


where each channel has a bandwidth of 200 kHz. There are two methods to reduce wans- 
mission interference from adjacent cells. First, different sets of frequencies are assigned 
to adjacent base stations. Second, frequencies are reused according to some design plan, 
such as the frequency plans shown in Figure 15.21, where the frequency reuse factor is 
either N = 4 or N = 7. Larger values of N increase the distance d, between two base 
stations using the same set of frequencies, thus, reducing co-channel interference. On the 
other hand, a large value of N reduces the spectral efficiency of the cellular system, since 
fewer frequencies are assigned to each cell. The maximum radius for a cell is 35 km. 

Each 200-kHz frequency band accommodates eight users by creating eight TDMA 
nonoverlapping time slots, as shown in Figure 15.22. The eight time slots constitute a frame 
of duration 4.615 msec, and each time slot has a time duration of 576.875 usec. The infor- 
mation data from the users is transmitted in bursts at a rate of 270.833 kbps. Figure 15.23 
illustrates the basic GSM frame structure, where 26 frames form a multiframe and 51 multi- 
frames form a superframe. The framing hierarchy facilitates synchronization and network 
conwol. To reduce the effect of fading and interference and provide signal diversity, the 
carrier frequency is hopped at the frame rate of (nominally) 217 hops/sec. 

The functional block diagram of the transmitter and receiver in the GSM system is 
shown in Figure 15.24. The speech coder is based on a type of linear predictive coding 
(LPC) called residual pulse-excited (RPE) linear predictive coding (RPE-LPC). RPE-LPC 
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Figure 15.22 TDMA frame in GSM. 
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Figure 15.23 GSM frame structure. 


delivers 260 bits in each 20-msec time interval—hence, a bit rate of 13 kbps. The most 
significant bits are encoded by a rate 1/2, constraint length L = 5 convolutional encoder, 
and the coded and uncoded bits are block interleaved to produce data at a rate of 22.8 kbps. 
Thus, the 260 information bits are transformed into 456 coded bits in each 20-msec time 
interval. The coded bit stream is encrypted. Then, it is organized for burst transmission 
in time slots that carry 114 coded bits and some overhead bits, as shown in Figure 15.23, 
including a sequence of 26 bits that measure the channel characteristics in each time slot. 
Therefore, the 456 coded bits are transmitted in four consecutive bursts, where each burst 
contains 114 coded bits and occupies one time slot. 

To transmit the bits in each time slot, we use GMSK modulation with BT = 0.3. 
This signal pulse is illustrated in Figure 9.25(e). The output of the GMSK modulator is 
translated to the desired carrier frequency, which is hopped to a different frequency in each 
frame. 

At the receiver, the received signal is dehopped and translated to baseband; this cre- 
ates in-phase (7) and quadrature (Q) signal components, which are sampled and buffered. 
The 26 known transmitted bits measure the channel characteristics; thus, they specify the 
matched filter to the channel corrupted signal. The data bits are passed through the matched 
filter and the matched filter output is processed by a channel equalizer, which may be real- 
ized either as a decision-feedback equalizer (DFE) or a maximum-likelihood sequence 
detector that is efficiently implemented via the Viterbi algorithm. The bits in a burst at the 
output of the equalizer are deassembled, deencrypted, deinterleaved and passed to the chan- 
nel decoder. The decoded bits are used to synthesize the speech signal that was encoded 
via RPE-LPC. 
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TABLE 15.3 SUMMARY OF PARAMETERS IN A GSM SYSTEM 








System Parameter Specification 
Uplink frequency band 890-915 MHz 
Downlink frequency band 935-960 MHz 

Number of carriers/band 125 

Bandwidth/carrier 200 kHz 

Multiple access method TDMA 

Number of users/carrier 8 

Data rate/carrier 270.8 kbps 

Speech coding rate 13 kHz 

Speech encoder RPE-LPC 

Coded speech rate 22.8 kbps 

Modulation GMSK with BT = 0.30 
Demodulation Matched filter + equalizer 
Interleaver Block 


Frequency hopping rate 


217 hops/sec 


Chapter 15 


In addition to the channels that transmit the digitized speech signals, there are other 
channels that handle various control and synchronization functions, such as paging, fre- 
quency correction, synchronization, and requests for access to send messages. These con- 
trol channels are additional time slots that contain known sequences of bits for performing 
the control functions. 

Table 15.3 provides a summary of the basic parameters in the GSM system. 


15.7.2 CDMA System Based on IS-95 

As described in Section 15.2, the enhancement in performance obtained from a DS spread- 
spectrum signal through the processing gain and coding gain can enable many DS spread- 
spectrum signals to simultaneously occupy the same channel bandwidth provided that 
each signal has its own distinct pseudorandom sequence. Direct sequence CDMA has 
been adopted as one multiple-access method for digital cellular voice communications 
in North America. This first generation digital cellular (CDMA) communication system 
was developed by Qualcomm, and it has been standardized and designated as IS-95 by 
the Telecommunications Industry Association (TIA) for use in the 800 MHz and the 1900 
MHz frequency bands. A major advantage of CDMA over other multiple access methods 
is that the entire frequency band is available at each base station, i.e., the frequency reuse 
factor N = 1. 

The nominal bandwidth used for transmission from a base station to the mobile 
receivers (forward link) is 1.25 MHz. A separate channel, also with a bandwidth of 1.25 
MHz, is used for signal tansmission from mobile receivers to a base station (reverse link). 
The signals transmitted in both the forward and the reverse links are DS spread spectrum 
signal and they have a chip rate of 1.2288 x 10° chips per second (1.2288 Mchips/sec). 


Forward Link. A block diagram of the modulator for the signals transmitted 
from a base station to the mobile receivers is shown in Figure 15.25. The speech coder is 
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Figure 15.25 Block diagram of a IS-95 forward link. 


a code-excited linear predictive (CELP) coder that generates data at the variable rates of 
9600, 4800, 2400, and 1200 bits/sec, where the data rate is a function of the user’s speech 
activity in frame intervals of 20 msec. The data from the speech coder is encoded by a rate 
1/2, constraint length L = 9 convolutional code. For lower speech activity, where the data 
rates are 4800, 2400, or 1200 bits/sec, the output symbols from the convolutional encoder 
are repeated either twice, four times, or eight times to maintain a constant bit rate of 9600 
bits/sec. At the lower speech activity rates, the transmitter power is reduced by either 3, 6, 
or 9 dB, so that the transmitted energy per bit remains constant for all speech rates. Thus, 
a lower speech activity results in a lower transmitter power and, hence, a lower level of 
interference to other users. 

The encoded bits for each frame are passed through a block interleaver, which over- 
comes the effects of burst errors that may occur in transmission through the channel. The 
data bits at the output of the block interleaver, which occur at a rate of 19.2 kbits/sec, are 
scrambled by multiplication with the output of a long code (period N = 24? — 1) generator 
running at the chip rate of 1.2288 Mchips/sec, but whose output is decimated by a factor 
of 64 to 19.2 kchips/sec. The long code uniquely identifies a call of a mobile station on the 
forward and reverse links. 

Each channel user is assigned a Hadamard (also called a Walsh) sequence of length 
64. There are 64 orthogonal Hadamard sequences assigned to each base station; thus, there 
are 64 channels available. One Hadamard sequence (the all-zero sequence) is used to wans- 
mit a pilot signal, which serves as a means for measuring the channel characteristics, 
including the signal strength and the carrier phase offset. These parameters are used at 
the receiver to perform phase coherent demodulation. Another Hadamard sequence is used 
to provide time synchronization. One channel, and possibly more if necessary, is used for 
paging. That leaves up to 61 channels for allocation to different users. 
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Each user, using the assigned Hadamard sequence, multiplies the data sequence 
by the assigned Hadamard sequence. Thus, each encoded data bit is multiplied by the 
Hadamard sequence of length 64. The resulting binary sequence is now spread by multipli- 
cation with two PN sequences of length N = 2!5; this creates in-phase (7) and quadrature 
(Q) signal components. Thus, the binary data signal is converted to a four-phase signal 
and both the J and Q components are filtered by baseband spectral-shaping filters. Differ- 
ent base stations are identified by different offsets of these PN sequences. The signals for 
all 64 channels are transmitted synchronously so that, in the absence of channel multipath 
distortion, other signals received at any mobile receiver do not interfere because of the 
orthogonality of the Hadamard sequences. 

At the receiver, a RAKE demodulator resolves the major multipath signal compo- 
nents. These components are then phase-aligned and weighted according to their signal 
strength, using the estimates of phase and signal strength derived from the pilot signal. 
These components are combined and passed to the Viterbi soft-decision decoder. 


Reverse link. The reverse link modulator from a mobile transmitter to a base 
station is different from the forward link modulator. A block diagram of the modulator 
is shown in Figure 15.26. An important consideration in the design of the modulator is 
that signals transmitted from the various mobile transmitters to the base station are asyn- 
chronous; hence, there is significantly more interference among users. In addition, the 
mobile transmitters are usually battery operated; consequently, these transmissions are 
power limited. To compensate for these major limitations, a rate 1/3, K = 9 convolu- 
tional code is used in the reverse link. This code has essentially the same coding gain in 
an AWGN channel as the rate 1/2 code used in the forward link. However, it has a much 
higher coding gain in a fading channel. Again, for lower speech activity, output bits from 
the convolutional encoder are repeated either two, four, or eight times. However, the coded 
bit rate is 28.8 kbits/sec. g 

For each 20-msec frame, the 576 encoded bits are block-interleaved and passed to the 
modulator. The data are modulated using an M = 64 orthogonal signal set of Hadamard 
sequences each of length 64. Thus, a 6-bit block of data is mapped into one of the 64 
Hadamard sequences. The result is a bit (or chip) rate of 307 .2 kbits/sec at the output of the 
modulator. Note that 64-ary orthogonal modulation at an error probability of 1076 requires 
approximately 3.5 dB less SNR per bit than binary antipodal signaling in an AWGN 
channel. 

To reduce interference to other users, the time position of the transmitted code sym- 
bol repetitions is randomized; thus, at the lower speech activity, consecutive bursts are not 
evenly spaced in time. Following the randomizer, the signal is spread by the output of the 
long code PN generator, which is running at a rate of 1.2288 Mchips/sec. Hence, there are 
only four PN chips for every bit of the Hadamard sequence from the modulator, so the 
processing gain in the reverse link is very-small. The resulting 1.2288-Mchips/sec binary 
sequences of length N = 215, whose rate is also 1.2288 Mchips/sec, create I and Q sig- 
nals (a QPSK signal) that are filtered by baseband spectral shaping filters and then passed 
to quadrature mixers. The Q-channel signal is delayed in time by one-half PN chip relative 
to the 7-channel signal prior to the baseband filter. In effect, the signal at the output of the 
two baseband filters is an offset QPSK signal. 
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TABLE 15.4 SUMMARY OF PARAMETERS IN THE IS-95 SYSTEM 





System Parameter 


Uplink frequency band 
Downlink frequency band 
Number of carriers/band 
Bandwidth/carrier 
Multiple access method 
Number of users/carrier 
Chip rate 

Speech coder 

Speech rate 

Interleaver 

Channel encoder 


Modulation 


Demodulation 


Signature sequences 
PN sequence 


Specification 


824-849 MHz 

869-894 MHz 

20 

1.25 MHz 

CDMA 

60 

1.2288 Mcps 

Variable rate CELP 

9600, 4800, 2400, 1200 bps 
Block 

R=1/2, L =9 (D) 

R =1/3, L =9 (U) : 
BPSK with QPSK spreading (D) 
64-ary orthogonal with QPSK 
spreading (U) 

RAKE matched filter with 
maximal-ratio combining 
Hadamard (Walsh) of length 64 
N = 2” — 1 (Long code) 


N = 2" (spreading codes) 





Although the chips are transmitted as an offset QPSK signal, the demodulator 
employs noncoherent demodulation of the M = 64 orthogonal Hadamard waveforms 
to recover the encoded data bits. A computationally efficient (fast) Hadamard transform 
reduces the computational complexity in the demodulation process. The output of the 
demodulator is then fed to the Viterbi decoder, whose output synthesizes the speech signal. 

Table 15.4 provides a summary of the basic parameters in the IS-95 system. 


15.7.3 Third Generation Cellular Communication Systems and Beyond 


The first generation cellular radio systems employed analog signal transmission. The GSM 
and IS-95 digital cellular systems described above are usually called second generation 
cellular radio systems. They are designed primarily for voice transmission using digital 
modulation for transmission. These systems have the capability of providing low-rate data 
transmission, i.e., 64 kbps. 

To accommodate the need for multimedia applications, such as internet access, image 
transmission, and video conferencing, higher data tate cellular systems have been devel- 
oped. These new systems are third generation cellular radio systems and have the potential 
to provide data rates of 5 Mbps. 

The evolution of IS-95 is called CDMA-2000. The channel bandwidth is increased to 
3.75 MHz, rather than the 1.25 MHz bandwidth used in IS-95. The corresponding chip rate 
is 3.684 Mcps. QPSK modulation is used in the downlink to achieve a peak data rate of up 
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to 2.4 Mbps. BPSK is used in the uplink. Higher data rates can be achieved by transmitting 
on multiple channels. 

The evolution of GSM is called UMTS or wideband COMA (WCDMA). A nominal 
channel bandwidth of 5 MHz supports a chip rate of 3.84 Mcps. The modulation for both 
uplink and downlink is BPSK. WCDMA offers symmetrical data rates of 384 kbps at a 
mobile terminal speed of 120 km/hour, 128 kbps at a mobile terminal speed of up to 300 
km/hour, and a data rate of 2 Mbps at pedestrian speed. 

Third generation cellular systems are designed differently from the older generation 
systems which were optimized for voice transmission. The basic difference in providing 
data services is the requirement for very high data rates in the downlink (forward link) 
from the base station to the mobile users. This requirement has resulted in a basic change 
by which the base station transmits to the mobile receivers. Specifically, the downlink 
operates in a time-shared manner (TDMA method) in transmitting to the mobile receivers. 
Thus, the downlink serves one user at a time and operates in a time-multiplexing manner. 

In order to optimize the system throughput, the signal-to-noise ratio (SNR) level of 
each mobile receiver is used by the base station to determine the data rate for each user in 
the downlink. The SNR level is measured in each user receiver and sent back to the base 
station via the uplink (reverse link). Thus, on the basis of the SNR value, the base station 
selects the data rate to be transmitted to the user in a particular time slot. The key element 
employed at the base station to determine the manner in which the users are serviced is the 
scheduler, which is programmed to operate in a way that ensures some degree of fairness 
and latency (service delay). For example, a round robin method of service guarantees fair- 
ness, but does not yield a high throughput because a user with a low SNR is served at a low 
data rate and, consequently, must be assigned more time slots. In contrast, throughput is 
optimized by allocating more time slots to users having a high SNR, since the base station 
can employ high level M-ary modulation to provide a high data rate. But such a method 
may result in long service delays (large latency) to users with low SNR. In practice, the 
scheduler employs an algorithm that provides some degree of fairness and simultaneously 
constrains the latency to an acceptable level. Thus, the design of the scheduling algorithm 
involves a compromise among throughput, fairness and latency. To further increase the 
data rate to the users in the downlink, the base station may use multiple transmit antennas 
to achieve spatial multiplexing, as described in Section 14.4. 

With the increase use of mobile devices, such as smart phones, tablet computers and 
laptop computers, even third generation cellular systems do not provide sufficient capacity 
and data speed to satisfy the demand for new services. This increase in demand has resulted 
in a new world-wide fourth generation (4G) standard, called long-term evolution (LTE). 
Some of the major capabilities and characteristics of 4G-LTE are 


1. Spectrum flexibility with possible bandwidths of 1.4 MHz, 5 MHz, 10 MHz, 15 
MHz, and 20 MHz. 

2. Peak data rates of up to 300 Mbits/sec on the downlink and 75 Mbits/sec on the 
uplink when using a 20 MHz bandwidth and 4 x 4 (MIMO) antennas. 

3. Also supports peak datarates of up to 100 Mbits/sec on the downlink and 50 Mbits/sec 
on the uplink when using a 20 MHz channel bandwidth with a single antenna at the 
mobile and two receive antennas at the base station. 
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4. Supports mobility for terminals moving up to 350 km/hour or 500 km/hour, depend- 
ing on the frequency band. 


5. OFDM is used for the downlink and a modified form of OFDM is used for the uplink. 


4G-LTE is being provided by all major cellular service providers around the world. 


15.8 SUMMARY AND FURTHER READING 


In this chapter we inwoduced the reader to spread spectrum digital communication tech- 
niques and their use in combating interference, both intentional, usually called jamming, 
and unintentional, the latter arising from other users of the same channel. In particular, 
we treated direct sequence (DS) spread spectrum and frequency-hopped (FH) spread spec- 
trum, and their performance characteristics in the presence of interference. We showed 
that in DS spread spectrum, the transmitted signal is spread in frequency by multiplying 
the information-bearing signal by a pseudo-noise (PN) signal that has the characteristics of 
white noise. At the receiver, the wideband received signal is despread by multiplying the 
received signal by a replica of the PN sequence. DS spreading was shown to be especially 
effective in combating narrowband interference. The probability of error for the detector 
was evaluated in the presence of narrowband interference. We also observed the benefits 
of coding in a DS spread spectrum system. 

Wecited four applications of DS spread spectrum in digital communications. Specif- 
ically, we described the use of DS spread spectrum signals in low detectability signal trans- 
mission, in code division multiple access (CDMA) systems in which multiple users share 
the same common channel, in communication over multipath channels to resolve the signal 
multipath components, and in wireless LANs, such as those based on the 802.11 (WiFi) 
standard. We also described the generation of PN sequences for use in spreading the spec- 
trum of the transmitted signal. 

In FH spread-spectrum systems, a PN sequence is used to pseudorandomly select the 
carrier frequency within the large bandwidth of the channel. We described slow frequency 
hopping systems, where the carrier frequency is changed at the symbol rate or slower, and 
fast frequency hopping systems, where the carrier frequency is changed multiple times 
within the symbol interval. We observed that slow frequency hopping systems are vulnera- 
ble to partial-band interference, but we also noted that by providing signal diversity in the 
design of the transmitted signal, this type of interference can be mitigated. Applications of 
FH spread spectrum signals were also cited. 

Synchronization of the locally generated PN sequence at the receiver with the PN 
sequence in the received signal is necessary in the demodulation of the received signal and 
in the recovery of the transmitted information. We described time synchronization that is 
performed in two phases, the initial acquisition phase and the tracking phase. 

In the final topic treated in this chapter, we described the use of spread spectrum in 
digital cellular communication systems, including 2nd, 3rd and 4th generation (2G, 3G, 
4G) cellular systems. 

Historically, the primary application of spread spectrum signals had been in the 
design of secure digital communication systems for military applications. However, in 
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the last three decades, we have seen the widespread use of spread spectrum signals in 
commercial applications, especially in mobile cellular communications, in multiple access 
communications via satellites, and in interoffice radio communications. 

A historical account on the development of spread specwum communications cov- 
ering the period 1920-1960 is given in the paper by Scholtz (1982). Tutorial treatments 
of spread spectrum signals that deal with basic concepts are found in papers by Scholtz 
(1977) and Pickholtz, et al. (1982). These papers also contain a large number of references 
to the previous work. Two tutorial papers by Viterbi (1979, 1985) contain a basic analysis 
of the performance characteristics of DS and FH spread-spectrum signals. 

Comprehensive treatments of various aspects concerning the analysis and design of 
spread-spectrum signals and systems, including synchronization techniques, are found in 
books by Simon et al. (1985), Ziemer and Peterson (1985), and Holmes (1982). Also, 
special issues of the IEEE Transactions on Communications (August 1977, May 1982) are 
devoted to spread-spectrum communications. These special issues contain a collection of 
papers devoted to a variety of topics, including multiple-access techniques, synchronization 
techniques, and performance analysis with various types of channel interference. A number 
of important papers that have been published in IEEE journals have been reprinted in book 
form by the IEEE press. [See Dixon (1976) and Cook et al. (1983).] 


PROBLEMS 


15.1 Demonstrate that a DS spread spectrum signal without coding provides no improve- 
ment in performance against additive white Gaussian noise. 


15.2 A total of 30 equal-power users are to share a common communication channel by 
CDMA. Each user transmits information at arate of 10 kbps via DS spread spectrum 
and binary PSK. Determine the minimum chip rate needed to obtain a bit error prob- 
ability of 1075. Additive noise at the receiver may be ignored in this computation. 


15.3 A CDMA system is designed based on DS spread spectrum with a processing gain 
of 1000 and binary PSK modulation. Determine the number of users, if each user 
has equal power and the desired level of performance is an error probability of 1076. 
Repeat the computation if the processing gain is changed to 500. 


15.4 A DS spread spectrum system transmits at a rate of 1000 bps in the presence of a 
tone interference. The interference power is 20 dB greater than the desired signal, 
and the required ,/J, to achieve satisfactory performance is 10 dB. Determine the 
spreading bandwidth required to meet the specifications. 


15.5 A DS spread spectrum system is used to resolve the multipath signal component in a 
two-path radio signal propagation scenario. If the pathlength of the secondary path is 
300 m longer than that of the direct path, determine the minimum chip rate necessary 
to resolve the multipath component. 
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A CDMA system consists of 15 equal-power users who transmit information at a 
rate of 10,000 bps each. They use a DS spread spectrum signal operating at a chip 
rate of 1 MHz. The modulation is binary PSK. 


1. Determine the @,/Jo, where J is the spectral density of the combined interfer- 
ence. 


2. What is the processing gain? 


3. How much should the processing gain be increased to allow for doubling the 
number of users without affecting the output SNR? 


An FH binary orthogonal FSK system employs an m = 15 stage linear feedback 
shift register that generates a maximal length sequence. Each state of the shift reg- 
ister selects one of N nonoverlapping frequency bands in the hopping pattem. The 
bit rate is 100 bits/sec and the hop rate is once per bit. The demodulator employs 
noncoherent detection. 


1. Determine the hopping bandwidth for this channel. 
2. What is the processing gain? 
3. What is the probability of error in the presence of AWGN? 


A DS binary PSK spread spectrum system has a processing gain of 500. What is 
the interference margin against a continuous tone interference if the desired error 
probability is 1075? 


Suppose that {c,;} and{c2;} are two binary (0, 1) periodic sequences with periods Lı 
and L3, respectively. Determine the period of the sequence obtained by forming the 
modulo 2 sum of {c;;} and{c2;}. 


Anm = 10 maximum-length shift register generates the pseudorandom sequence in 
a DS spread spectrum system. The chip duration is Te = lusec and the bit duration 
is T, = LT,, where L is the length (period) of the m-sequence. 


1. Determine the processing gain of the system in dB. 


2. Determine the interference margin if the required @,/Jo = 10 and the interfer- 
ence is a tone interference with an average power Pyy. 


Figure P-15.11 illustrates the average power multipath delay profile in cellular com- 
munication for (a) suburban and urban areas and (b) hilly terrain area. In a GSM 
system, the bit rate is 270.8 kbps; in an IS-95 system (forward link), the bit rate 
is 19.2 kbps. Determine the number of bits affected by intersymbol interference 
in the transmission of the signal through the channels (a) and (b) for each cellular 
system. 
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Figure P-15.11 


15.12 For the multipath delay profiles shown in Figure P-15.11, determine the number of 
taps in a RAKE demodulator for the IS-95 forward link that would be needed to span 
the multipath for channels (a) and (b). Of the total number of RAKE taps, how many 
will contain signal components and how many will have no signal for channel (b)? 


15.13 A widely used model for the Doppler power spectrum of a mobile radio channel is 
the so-called Jakes’ model, given as 


1 1 
— c, |fl fa 
S(f) = | Thm V1- F/ fm)? 
0, otherwise 


where fm = ufo/c is the maximum Doppler frequency, v is the vehicle speed in m/s, 
fo is the carrier frequency, and c is the speed of light (3 x 108 m/sec). Determine fn 
for an automobile traveling at 100 km/hour and for a train traveling at 200 km/hour. 


Plot S( f) for the two vehicles for a cellular communication system with a carrier 
frequency of 900 MHz. 
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15.14 In Example 15.3.1, suppose that the DS spread spectrum signal is transmitted via 


15.15 


15.16 


15.17 


15.18 


radio to a receiver at a distance of 2000 km. The transmitter antenna has a gain 
of 20 dB, while the receiver antenna is omnidirectional. The carrier frequency is 3 
MHz, the available channel bandwidth W = 10° Hz, and the receiver has a noise 
temperature of 300° K. Determine the required transmitter power and the bit rate of 
the DS spread spectrum system. 


A rate 1/2 convolutional code with dfree = 10 is used to encode a data sequence 
occurring at a rate of 1000 bits/sec. The modulation is binary PSK. The DS spread 
spectrum sequence has a chip rate of 10 MHz. 


1. Determine the coding gain. 
2. Determine the processing gain. 


3. Determine the interference margin assuming an €,/J, = 10. 


Consider the FH binary orthogonal FSK system described in Problem 15.7. Sup- 
pose that the hop rate is increased to two hops/bit. The receiver uses square-law 
combining to combine the signal over the two hops. 


1. Determine the hopping bandwidth for the channel. 
2. What is the processing gain? 
3. What is the error probability in the presence of AWGN? 


In a fast FH spread-spectrum system, the information is transmitted via FSK, with 
noncoherent detection. Suppose there are N = 3 hops/bit, with hard-decision decod- 
ing of the signal in each hop. 


1. Determine the probability of error for this system in an AWGN channel with 
power-spectral density So and an SNR=13 dB (total SNR over the three 
hops). 


2. Compare the result in (1) with the error probability of an FH spread-spectrum 
system that hops once/bit. 


A slow FH binary FSK system with noncoherent detection operates at an 
Eb/lo = 10, with a hopping bandwidth of 2 GHz, and a bit rate of 10 Kbps. 


1. What is the processing gain for the system? 


2. In the case of a partial-band interference, what is the bandwidth occupancy for 
worst-case performance? K 


3. What is the probability of error for the worst-case partial-band interference? 
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COMPUTER PROBLEMS 


15.1 Simulation of Direct Sequence Spread Spectrum Signals 


The objective of this problem is to demonstrate the effectiveness of a DS spread 
spectrum signal in suppressing sinusoidal interference via Monte Carlo simulation. 
The block diagram of the system to be simulated is illustrated in Figure CP-15.1. 
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Figure CP-15.1 


A uniform random number generator (RNG) generates a sequence of binary informa- 
tion symbols (+1). Each information bit is repeated Le times, where Le corresponds 
to the number of PN chips per information bit. The resulting sequence, which con- 
tains L, repetitions per bit, is multiplied by a PN sequence c(n) generated by another 
uniform RNG. To this product sequence, we add white Gaussian noise with variance 
o? = No /2 and sinusoidal interference of the form 


i(n) = A sin won, 


where 0 < wo < v and the amplitude of the sinusoid is selected to satisfy A < Le. 
The demodulator performs the cross correlation with the PN sequence and sums 
(integrates) the blocks of Le signal samples that constitute each information bit. The 
output of the summer is fed to the detector, which compares this signal with the 
threshold of zero and decides whether the transmitted bit is +1 or —1. The error 
counter tallies the number of errors made by the detector. Plot the measured error 
probability as a function of the SNR for three different values of the amplitude A = 
0, A = 3, and A = 10 of the sinusoidal interference with Le = 20. The SNR is 
defined as €,/No and can be varied by setting €, = 1 and scaling the variance o? of 
the additive Gaussian noise. 
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Simulation of Synchronous CDMA System 


Write a MATLAB program that performs a Monte Carlo simulation of four time- 
synchronous CDMA users when each user employs a distinct Gold sequence of 
length L = 31. The four Gold sequences are as follows: 


i1i001100000010011110011001111011 
1000101010100011001010100000110 
1010111111000010111001111111100 
1110010100000001011111000001000 


The users employ the binary (+1) modulation of their representative Gold sequences 
with the number of chips per bit equal to 31. The receiver for each user correlates the 
composite CDMA received signal, which is corrupted by additive white Gaussian 
noise (added on a chip-by-chip basis) with their respective sequence. Using Monte 
Carlo simulation with N = 10,000 information bits, estimate and plot the probabil- 
ity of error for each user as a function of SNR. 


Simulation of Asynchronous CDMA system 


Repeat Problem 15.2 when the four users transmit asynchronously. For example, 
simulate the case when the four-user CDMA signals are offset in time by one chip 
relative to one of the other signals. That is, the CDMA signal of user 2 is delayed 
by one chip relative to the first user, the CDMA signal of user 3 is delayed by one 
chip relative to the signal of user 2, and the CDMA signal of user 4 is delayed by 
one chip relative to the signal of user 3. Compare the error probability obtained with 
asynchronous transmission to that obtained with synchronous transmission. 


Generation of a Maximal Length Shift Register Sequence 


Write a MATLAB program thatimplements an m = 12-stage maximum-length shift 
register and generate three periods of the sequence. Compute and graph the peri- 
odic autocorrelation function of the equivalent bipolar sequence given by Equation 
(15.4.1). 


Detection of an LPI Signal 


An LPI signal is generated by binary modulation of an m-sequence from a 10-stage 
shift register, (L = 1023) and, hence, the number of chips per information bit is 
1023. The output of the shift register is mapped into the bipolar sequence 


es 1 ifthe shift register output is a 1 
«~~ )-1. ifthe shift register output is a 0 


The transmitted sequence of chips is corrupted by AWGN, so the received signal 
sequence at the output of the chip matched filter is 


re=SChtny, k=1,2,...,1023 
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‘where the binary data bit s is either +1 or —1, forthe entire sequence 0 < k < 1023. 


1. Generate the m-sequence {c} and verify Equation (15.4.2) is satisfied. 


2. Use the m-sequence generated in part 1 to construct the received signal sequence 
{r,} and plot it for k = 1, 2, ..., 1023, when the variance of the Gaussian noise 
samples is a? = 10. Is the transmitted signal sequence x, = scx, 1 < k < 1023 
discernable in {rg}? 


3. Compute the cross correlation of {r} with {c,} and plotthe result 


n 


Yn =) reck, n=1,2,...,1023 
k=) 


Comment on the result of this correlator output. 


Simulation of Frequency Hopped FSK 


An FH binary orthogonal FSK system employs an m = 7-state shift register to gener- 
ate a periodic maximum-length sequence of length L = 127. Each stage of the shift 
register selects one of N = 127 nonoverlapping frequency bands in the hopping pat- 
tem. Write a MATLAB program that simulates the selection of the center frequency 
and the generation of the two frequencies in each of the N = 127 frequency bands. 
Show the frequency selection pattem for the first 10 bit intervals. 


Simulation of an FSK System with Partial Band Interference 


Via Monte Carlo simulation, demonstrate the performance of an FH digital com- 
munication system that employs binary FSK and is corrupted by worst-case partial- 
band interference. The block diagram of the system to be simulated is shown in 
Figure CP-15.7. A uniform random number generator (RNG) is used to generate a 
binary information sequence, which is the input to the FSK modulator. The output 
of the FSK modulator is corrupted by additive Gaussian noise with probability £, 
where 0 < 6 < 1. A second uniform RNG is used to determine when the additive 
Gaussian noise corrupts the signal and when it does not. In the presence of noise, the 
input to the detector, assuming that a 0 is transmitted, is 


r= (Vē coso + ne) + (Ve sind + ns) 


2 2 

2 = Ne t Nis 
where ¢ represents the channel phase shift and 71c, 115, n2c, nas represent the additive 
noise components. In the absence of noise, we have 


ri =p, rn =0 


and, hence, no errors occur at the detector. The variance of each of the noise com- 
ponents is o? = Jo/2f, where £ is given by Equation (15.5.4). For simplicity, we 
may set ¢ = 0 and normalize Jo by setting it equal to unity. Then pp = €,/Jo = Sp. 
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Figure CP-15.7 


Because o? = Jo /2B and B = 2/pp, it follows that, in the presence of partial-band 
interference, o? = $,/4 and B = 2/€p, where , is constrained to 8, > 2. Perform 
the simulation for 10,000 data bits for the signal energies in the range of 5 dB to 
25 dB, and plot the error probability. 
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catastrophic, 716-717 
compared to block codes, 694 
decoding algorithms for, 722 
with good distance properties, 725 
MAP decoding, 731-737 
maximum likelihood decoding of, 
717-722 
parallel concatenated (PCCC), 729 
rate of, 694 
state-wansition diagram, 712 
trellis diagram, 712-713 
Convolutional encoder: 
block diagram of, 711-712 
recursive systematic, 729 
Convolution integral, 42 
Correlation, 204 
Correlation coefficient, 205 
Correlation metrics, 394 
Correlation-type demodulator, 362-371 
for binary antipodal signals, 362-365 
for binary orthogonal signals, 365-371 
Coset, 703 
Coset leader, 706 
Costas loop, 437—439, 441, 855 
block diagram of, 437 
Covariance, 204 
CPFSK, See Continuous-phase FSK 
(CPFSK) 
CPM pulse shapes, 523 
Cross correlation, between random 
processes, 217-218 
Crossover probability, 663 
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Multiple random processes, 217-218 
Multiple random variables, 203—208 
multiple functions of, 206-208 
Multiple receiving antennas, using 
with only one transmitting antenna, 
786 
Multiplexing, 326 
defined, 144 
Mutual information, 650 


N 


Nakagami distribution, 789 

Narrowband angle modulation, 165 

National Television Systems Committee 
(NTSC), 643 

Natural binary coding (NBC), 312 

codes fora 16-level quantization, 312 

NBC coding, 312 

Noise equivalent bandwidth, 234 

Noise figure, 282 

Noisy channel coding theorem, 642, 664, 
667-668 

Noncausal signals, 27 

Noncausal systems, 41 

Noncoherent demodulation and detection, 
504 


Index 


Nonlinear quantization, 315 
Nonlinear systems, 39—40 
Nonperiodic signals, 26-27 
Nonuniform PCM, 315-316 
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Quantization noise, 304 
power, 304 
Quantization region, 302 
Quantization table for JPEG, 334 
Quantized values, 306 
Quaternary CPFSK, phase trajectory for, 
515, 517 
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RAKE demodulator, 839 
defined, 793 
and frequency selective channels, 
792-7194 
Random errors, 709 
Random experiment, 190-191 
Random processes, 190-254 
autocorrelation function, 214 
and linear systems, 218-220 
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multiple, 217-218 
statistical averages, 212-215 
wide-sense stationary random 
processes, 215—217 
Random signals, 26, 209 
Random variables, 190—209 
Bernoulli, 196, 198, 203 
binomial, 196-197, 203 
binormal, 208 
continuous, 194 
discrete, 194 
functions of, 201-203 
Statistical averages, 202—203 
Gaussian, 197—201, 203 
i.i.d. (independent and identically 
distributed), 209 
jointly Gaussian, 208 
mixed, 195—196 
multiple, 203—208 
multiple functions of, 206-208 
normal, 197—201 
standard normal, 197 
statistically dependent, 205 
statistically independent, 204 
sums of, 208—209 
uncorrelated, 205 
uniform, 197, 217 
Rate: 
code, 643 
entropy, 648-649 
Nyquist, 10, 643 
spectral bit, 529, 672 
Rayleigh fading channels, 776 
performance of binary modulation in, 
783-795 
Rayleigh probability density function, 
208 
Rayleigh’s theorem, 72 
Reactance tube, 172 
Real signals, 24 
Fourier series for, 49-52 
Realization of the random process, 211 
Receiving filters, for a known channel, 
design of, 578-582 
Rectangular pulse, 31-32 
Recursive systematic convolutional codes 
(RSCC), 729 
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Reed-Solomon codes, 708, 728, 757 
Regenerative repeaters, 456-457 
Regular LDPC codes, 744 
Reliable communications, coding for, 
689-768 
Remez algorithm, 86 
Repeaters, 284—287 
analog, 284 
regenerative, 456-457 
Residual pulse-excited (RPE) linear 
predictive coding (RPELPC), 859 
Rician distribution, 242, 789 
Ring modulator, 140 
Roll-off factor, 562 
RPELPC, See Residual pulse-excited 
(RPE) linear predictive coding 
(RPELPC) 
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Sample space, 191 
Sampling theorem, 297—301 
Samuel Morse, 1 
Scalar quantization, 302-309 
nonuniform quantizers, 305-309 
uniform quantizers, 305 
Second generation cellular radio systems, 
866 
Second-order properties, 208 
Self information, 644 
Self-synchronizing code, 655 
Sequential decoding, 722 
Shannon, Claude E., 10-11, 458, 652, 
654, 664, 675 
Shannon limit, 492, 530, 533 
Shockley, William, 4 
Sifting property of impulse signal, 35 
Sigma-delta modulator (SDM), 331 
Signal diversity, performance 
improvement through, 786-792 
Signal fading, 16, 769, 773 
Signal multiplexing, 144-146 
frequency-division multiplexing, 
144-145 
quadrature-carrier multiplexing, 
145-146 
Signal power, 304 
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Signal transmission, intersymbol 
interference in, 547-549 
Signal waveforms, geometric 
representation of, 348—352 
Signals: 
anticausal, 27 
baseband pulse, 520 
basic concepts, 21--43 
basic operations on, 21—23 
binary antipodal, 352-355, 362-365, 
371-373, 379 
binary orthogonal, 365-371, 373-374 
causal, 27 
classification of, 23-31 
complex, 24—26 
Hermitian symmetry for, 29 
complex exponential, 31 
continuous-time, 23—24 
delta, 35-38 
deterministic, 26 
discrete-time, 23-24 
discrete-time continuous-valued, 296 
energy-type, 29-31, 89-92 
even, 27—29 
Hermitian, 29 
impulse, 35-38 
information-bearing, 21 
and linear systems, 21-116 
lowpass, 82, 98-100 
noncausal, 27 
nonperiodic, 26-27 
odd, 27-29 
orthogonal, 488--491 
partial response, 521 
quadrature amplitude-modulated, 
419-429 
PSK, geometric representation of, 
408--410 
periodic, 26-27 
power-type, 29-31, 89, 92-95 
random, 26, 209 
real, 24—26 
Fourier series for, 49-52 
rectangular pulse, 31—32 
signum, 34 
sinc, 33 
sinusoidal, 23, 31 
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speech, 321-322 
spread-spectrum, 825-826 
triangular, 32—33 
unit-step, 31, 36 
video, 8, 117, 134 
Signal-to-noise ratio (SNR), 255, 382 
Signal-to-quantization noise ratio 
(SQNR), 314 
Signum signal, 34 
Sinc signal, 33 
Single-sideband AM signals, 132—134 
demodulation of, 133-134 
derivation of the expression for, 
149-150 
effect of noise on, 258—259 
Sinusoidal pulse shape, 519 
Sinusoidal signal, 23, 31 
SISO decoder, See Soft-input-soft-output 
(SISO) decoder 
Sky-wave propagation, 14-16 
Slope overload distortion, 320 
Slowly fading, frequency nonselective 
channel, 776 
SNR/bit, 382 
Soft-decision decoding, 700—702, 707 
defined, 701 
Soft-input-soft-output (SISO) decoder, 
735 
Soft-output Viterbi algorithm (SOVA), 
731 
Source coding theorem, 652—654 
Space-time block codes (STBC), 
802-803 
Space-time codes, 802, 808 
Space-time trellis codes, 808-810 
Spectral bit rate, 529 
Spectral characteristics of OFDM signals, 
629-631 
Spectrum despreading, 830 
Speech generation mechanism, model for, 
322 
Speech signals, 321-322 
Spread-spectrum communication 
systems, 825-876 
frequency-hopped (FH) spread 
spectrum, 843-849 
model of, 826-827 
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Spread-spectrum signals, 825-826 
Squared error distortion, 303 
Square-law combiner, 787 
Square-law detector, 509 
Stack algorithm, 722 
Staggered quadrature PSK (SQPSK), 
519 
Standard array, 703 
Standard deviation, 197, 203 
Standard normal random variable, 197 
State diagram, 515 
State trellis, 515 
State-transition diagram, 712 
Stationary processes, power spectral 
density of, 220-225 
Statistical averages, 212-215 
Statistically dependent random variables, 
205 
Statistically independent random 
variables, 204 
STBC, See Space-time block codes 
(STBC) 
Stochastic gradient algorithm, 591 
Storage channels, 17 
Subscriber local loop, 633 
Subset decoding, 756 
Sum process, power spectral density of, 
225-226 
Supergroup channel, 145 
Superheterodyne AM radio receiver, 3 
Superheterodyne receiver, 147 
Suppressed-carrier signal, 121 
Survivor path, 719 
Switching modulator, 138-140 
Symbol-by-symbol detection of data with 
controlled ISI, 567—570 
Symbol-spaced equalizer, 586 
Symbol synchronization, 446-456 
for carrier-modulated signals, 455-456 
early-late gate synchronizers, 447-449 
maximum likelihood method, 451--452 
minimum mean square error method, 
450-451 
spectral-line method, 452-455 
Symmetric channel, binary, 663 
Synchronous demodulator, 126, 257 
Systematic code, 697 
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Systems: 

causal, 41 

classification of, 38-41 

continuous-time, 39 

discrete-time, 39 

linear, 39-40 

linear time-invariant (LTI), 40, 41--42 
analysis in the time domain, 41 
response to periodic signals, 54-56 

noncausal, 41 

nonlinear, 39--40 

time-invariant, 40-41 

time-varying, 40-41 
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Tail probability in a Gaussian random 
variable, 200 
Tanner graph, 742 
for low-density parity-check codes, 
743 
Tapped delay line channel model, 
711-7718 
TCM, See Trellis-coded modulation 
(TCM) 
TDM, See Time-division multiplexing 
(TDM) 
TDMA, See Time-division multiple 
access (TDMA) 
Telephone-line modems, coding for, 
758-759 
Television broadcasting, 296, 642 
Telstar I, 4 
Thermal noise, 6, 12, 226 
filtered, 231 
properties of, 230 
sources, characterization of, 279—280 
Third generation cellular radio systems, 
866 
3-db bandwidth, 83 
Threshold effect, 270, 271 
Threshold SNR, 272 
Time diversity, 786 
Time reversal, 22 
Time scaling, of a signal, 22-23 
Time shifting, 21 
Time-average autocorrelation function, 92 
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Time-division multiple access (TDMA), 
858 
Time-division multiplexing (TDM), 326 
Time-invariant systems, 40—41 
Time-variant channels, examples of, 582 
Time-varying systems, 40-41 
Timing recovery method for QAM, block 
diagram of, 455 
Toeplitz matrix, 324 
Total probability theorem, 192 
Transducer, 5 
Transform-coding techniques, 332 
Transmission losses, 283—284 
Transmitting filters, for a known channel, 
design of, 578-581 
Transversal filter, 585, 586 
Trellis: 
phase, 515 
state, 515 
Trellis-coded modulation (TCM), 747, 
749-156 
coded modulation, 750-755 
decoding TCM codes, 755-756 
set partitioning principles, 750 
Trellis diagram, 712-713 
Triangular signal, 32-33 
Trigonometric Fourier-series expansion, 
50 
Turbo codes, 727, 728-741, 757, 760 
iterative decoding, 737 
max-log-APP algorithm, 737 
multiplicity, 729, 741 
performance of, 739-741 
Turbo decoding, 729 
algorithm, 738 
Turbo encoder, block diagram of, 729 
Twisted-pair wireline channels, 12 
Two-sided power spectral density, 230 
Typical sequences, 652 
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UMTS, 867 

Uncorrelated random variables, 205 
Underwater acoustic channels, 17, 543 
Ungerboeck codes, 759 

Ungerboeck encoder, 755 
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Uniform quantizers, 305 

Uniform random variable, 197, 203 
Union bound, 396—398 

Uniquely decodable code, 656 

Unit-step signals, 27, 31 

Universal source coding algorithms, 660 
Unvoiced speech, 321 

Upper sideband, 120 
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Varactor diode, 171 

Variable-to-fixed length coding scheme, 
660 

Variance, 197, 203 

Vector quantization, 309-311 

Vestigial-sideband (VSB) AM, 134-137 

Video signal, 134 

Viterbi, Andrew, 574 

Viterbi algorithm (VA), 574-575, 583, 
598, 600, 717-722, 728, 731, 734, 
750, 756-757, 760, 809, 860 

Viterbi decoder, 767 

Voiced speech, 321 

Volta, Alessandro, 1 

Voltage spectrum, 59 

Voltage-controlled oscillator (VCO), 171, 
430 
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Walsh sequence, See Hadamard sequence 
Waveform coding, 312-321 
delta modulation (DM), 318-321 
differential pulse code modulation 
(DPCM), 317-318 
pulse code modulation (PCM), 313—317 
nonuniform PCM, 315-317 
uniform PCM, 313-315 
Weight of a codeword, 695 
White processes, 228—230 
Wideband CDMA (WCDMA), 867 
Wide-sense stationary random processes, 
215-217 
Wiener, 10 
Wiener-Khinchin theorem, 221—223 
Wireless communications, 3-4 
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Wireless LANs, 635 Z 
configuration, 635 

Wireline channels, 12-18, 456 
fiber optic channels, 13—14 
storage channels, 17—18 

Wozencraft, John M., 11 


Zero-forcing equalizer, 584—585 

Zero-mean complex-valued Gaussian 
random process, 776 

Zero-mean uncorrelated Gaussian random 
variables, 388 

Y Zero-mean unit-variance Gaussian 

random variable, 305 


Yule-Walker equations, 324 Zworykin, V. K., 4 
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